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PREFACE 

The author has attempted to illustrate in a somewhat 
brief manner the application of certain statistical techniques 
to the analysis of core sampling data. The statistical areas 
of frequency distributions, analysis of variances, and to a 
lesser degree, sampling, provide the basis for the study. 

A convenient reference system is used in the thesis. All 
equations are numbered as (Chapter, Number). Thus, equation 
(2-4) signifies the fourth equation of the second chapter. 

The notes are listed together before the bibliography and 
are numbered consecutively within each chapter. All mathemati- 
cal notations and significant terms used are listed and 
defined in Appendix A for ready reference. 

Actual field data v/as obtained for the study. The 
author is Indebted to the following people v/ho provided the 
field data upon which the project was based: Mr. Mack C. 

Colt and Mr. Wendell Weatherby of lola, Kansas; Mr. Schermerhorn 
of Tulsa, Oklahoma; Mr. Ray Plummer of Chanute, Kansas; and 
Mr. Carl Pate and the Oil Field Research Laboratory of 
Chanute, Kansas. 

The author wishes to express his appreciation to Dr. 

Charles F. Weinaug for his overall direction of the graduate 
program which led to the thesis, and to the Bureau of Supplies 
and Accounts, United States Navy, whose sponsorship made the 
thesis possible. 

The author is most deeply indebted to Dr. Floyd Preston 
whose close personal guidance and wise counsel were instru- 
mental in bringing the thesis to completion. 
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CHAPTER I 
INTRODUCTION 

Background 

Statistics may be considered in two senses. One con- 
ception of statistics is that of a collection of numerical 
or quantitative data, i.e., figure data, such as numerical 
data on births or unemployment. Statistics in the second 
sense is less well knovm, and refers to the techniques of 
analyzing data for decision-making. It may be thought of as 
the science of decision-making in the face of uncertainty. 

The application of statistics in this second sense to 
petroleum engineering problems is the purpose of this thesis. 
The petroleum engineer by the very nature of the realm in 
which he must operate is continually faced with making de- 
cisions based upon fragmentary and inconclusive information 
concerning the sub-surface of the earth. Because of the 
extreme complexity of the geological processes of erosion, 
material transport, deposition and burial of material, the 
porous media forming petroleum reservoirs are extremely 
heterogeneous. Physical properties can vary extensively from 
place to place within individual reservoirs. The extreme 
cost of sampling these reservoirs forces the engineer to 
construct mental and mathematical models from fragmentary 
information. Mathematical ^statistics would seem to provide 
a valuable tool for such model construction. The utilization 
of the science of statistics to provide systematic analysis 



I 



I 




2 



techniques to data that is available may provide additional 
relevant information as an assist to the decision-making process. 

It is the purpose of this thesis to test the applicability 
of certain statistical techniques for creating mathematical 
models of property variation xvithln petroleum reservoirs and 
to show how certain statistical techniques can be used to 
extend the Interpretation of fragmentary data. The study is 
limited to the analysis of core samples. Statistical techni- 
ques are applied to actual field data obtained from five 
fields. Specific results are presented to demonstrate calcula- 
tional techniques. 

The first approach in the study is to examine the 
probable frequency distributions of the various properties, 
testing to see if property distributions within individual 
fields satisfy the Gaussian normal distribution and if not, 
to describe the distributions by a family of generalized 
frequency curves knovm. as the Pearson system of frequency 
curves. An initial study of the distributions is considered 
essential, prior to the application of the other statistical 
techniques such as sampling and analysis of variances, since 
these depend upon the nature of the frequency distribution 
of the data under study. 

Origin of Statistics 

The origin of the modern science of statistics may be 
traced to the mid-seventeenth century \7hen two astute French 
mathematicians, Pascal and Fermot, v/ere presented with a 
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problem involving a game of chance and the interpretation 
of the probabilities associated xvith it. Their work led to 
solutions, not only of the problems proposed, but of more 
general ones. The methods employed by Pascal may be said 
to represent the beginning of the mathematics of probability, 
about which modern statistical theory centers today. The 
publication by Laplace in l8l2 of the epoch-making "Theorie 
Analytigue des Probabilities" laid a firro foundation for 
this theory.^ 

Statisticians in general from Pascal onward sought a 

method of describing the nature of the distributions of 

2 

chance effects. Table I shows the most notable of the 
methods developed to portray chance effects from the time 
of Pascal until Karl Pearson set forth his system of generalized 
frequency curves in 1895. 

The usual purpose of frequency distributions is to 
represent a sample of actual data drawn from a much larger 
or even infinite population. Even though a sample may be 
composed of a relatively small finite number of observa- 
tions, it may be reasonably representative of the larger 
universe from which it was dravm. Since it is virtually 
impossible to measure all the items comprising a universe, 
it is necessary to form a notion of the larger group from 
the study of a sample. Thus by constructing a hypothetical 
infinite population of vjhlch the' actual data is regarded as 
constituting a random sample, an understanding of the law 
of distribution of chance effects of the hypothetical 



5 



population may be obtained and specified by a few parameters. 

For example, to gain an understanding of the porosity 
characteristics of an area, it may be considered that the 
earth contains an infinite number of porosity measurements 
where it is impossible to measure each and every one of the 
porosities. By fitting a curve to a frequency distribution 
of the actual data obtained from a core sample, it is 
attempted to describe ^vhat appears to be the general form 
of the curve for the entire porosity population. 

For statistical work the normal or Gaussian curve is 
probably the best knovm and most heavily relied upon 
frequency distribution of those shovm in Table I, particularly 
in the theory of sampling. However, this distribution 
function does not apply to skewed frequency distributions, 
although numerous sets of data defy the normal curve and 
exhibit markedly skewed distributions. The Gaussian school 
of statisticians regarded skewness as a by-product of 
sampling and believed that skewness could be made to disappear 
completely if an infinite number of observations were available. 

With the recognition that the normal curve was not 
sufficient to characterize all natural observations, it 
became apparent that it was necessary either to devise 
methods of describing the most conspicuous departures from 
the normal distribution or to devise generalized frequency 
curves to describe distributions as they actually exist in 
the observational sphere. 

Karl Pearson followed the latter course and showed that 
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a set of frequency curves could be obtained by assigning 
values to the parameters in a certain first order differential 
equation which has its basis in the theory of probability. 

This approach is covered in Chapter II and the application 
of these curves to the actual field data is shown in Chapter III. 

Core Samples 

The taking of core samples from a reservoir has been an 
accepted practice for the past hundred years. At first, v/ell 
samples and cores had but one purpose — to locate oil."^ Thus 
it v;as necessary to take a sample from every well drilled. 

Today, hov;ever, with the advent of other techniques such as 
electric logging to aid in locating the oil the primary 
reason for taking samples has shifted to serve as a source 
of information about the reservoir and its contents. At 
present, core samples provide certain numerical parameters 
by which the field may be described. The most common is 
the arithmetic or weighted average of the various properties. 

The range and variances of the distribution of the properties 
are other easily obtained and useful parameters describing 
a field. 

A simple histogram showing the distribution presents 
visually the characteristics of a field. A cumulative 
frequency curve on graph paper will present the distribution 
and allow easy determination of such parameters as the median 
and possibly the mode. Prom a cumulative frequency curve an 
estimate of the percentage of a distribution which is above 
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a specified minimum point may be obtained. The range of a 
variable v/ithin any set quartile is likewise easily deter- 
mined. In summary, graphical methods of statistical data 
presentation permit certain numerical parameters to be 
obtained v;ith relative ease. 

Since it is no longer necessary to take samples from 
every v;ell, it is desired to determine the number of wells 
that should be core-sampled to provide information needed 
with an acceptable probability of obtaining reliable results. 

At present the number of core samples to be taken is deter- 
mined somewhat intuitively with a wide variation of opinion 
as to what is the necessary number. Are there statistical 
techniques available to serve as a guideline in determining 
how much information is needed and how such data should be 
interpreted? 

The method of interpretation of core data may be paramount 
since raw data by itself, Irregardless of the amount available, 
often supplies much information that is Irrelevant and 
immaterial. It is the obj’ect of statistical processes 
employed to exclude this irrelevant information and to 
isolate the whole of the relevant Information contained in 
the data. 

Data and Techniques 

The science of statistics consists of (l) collecting, 

( 2 ) presenting, and (5) analyzing quantitative data. The 
data used for this study consist of the physical properties 
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of oil fields, namely permeability (K), porosity (0), oil 

saturation (S^), and water saturation as obtained from 

4 

core samples taken from five different fields. Table II 
shows the type and amount of data considered. The five fields 
will be referred to as Field 1, Field 2, Field 3# Field 4, 
and Field 5* The numbering system has no significance other 
than the fields being numbered in sequence as data v;ere 
obtained for this study. 

Fields 1, 2, 4, and 5 are located in southeast Kansas 
while Field 3 is in northeast Oklahoma. Field 1 is in 
County "A" while Fields 2, and 5 are in County "B". 

The core samples for all five fields were taken and 
analyzed by the same laboratory with the same coring and 
analyzing techniques used for all fields. Thus even though 
there may have been errors made in arriving at the absolute 
values of measurements of the different properties, especially 
with regard to the fluid saturations, the errors may be 
considered consistent, and a relative comparison of the 
data may be made with a certain degree of confidence. 

TABLE II Summary of Core Sampling Data 

Field Wells No. of K No. of 0 No. of S No. of S 

Samples Samples Samples® Samples^^ 
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40 


1303 


794 


794 


794 
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14 


630 


345 


345 


345 
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7 


195 


132 


132 


132 


4 


19 


- 


316 


316 


201 


5 


101 


- 


1673 


1672 


1129 
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standard coring techniques v;ere employed in talcing the 
vertical cores of approximately 20 feet in length. Measure- 
ments of each property viere made approximately every six 
inches in the oil productive section \-ri.th approximately 
tvrenty samples being obtained for each well. The depth to 
the pay zone for the different fields varied somev/hat, but 
they all could be considered as shallow fields vjith pay 
zones at depths between six hundred and a thousand feet. 

The well spacing in most instances was approximately 
four hundred feet. In each field more than 50 percent of 
the wells were cored, with coring data available for all 
properties with the exception of permeabilities for Fields 
4 and 5- 

A map of each of the first three fields showing the 
locations of the wells cored is given in Appendix C. 

The above mentioned data were used for the following 
statistical studies v/hich constitute both the method of and 
justification for this thesis: 

1. Analytical Pitting of Data to the Pearson Generalized 
Frequency Curves. The study includes the technique for 
selecting the appropriate Pearson type curve, for 
fitting the data to the selected curve and for measuring 
the goodness of fit of the data to the curve. In addition 
the data X'fere fitted to the normal or Gaussian Curve 
and its goodness of fit determined. Permeability data 
were converted to logrithms and the resultant distribu- 
tions were analyzed by the above methods. 



2. An Analysis of Variance Study of Well Property Means 
and Variances. 

5 . Use of Certain Sampling Methods as a VJay of Estimating 
Certain Population Parameters from Point Estimates. 

The Application of Additional Statistical Techniques to 
Core Analysis Are Briefly Discussed. 

Statistics, in its many ramifications, is an exceedingly 
complex subject and much too involved to be thoroughly 
covered in a paper of this nature. The techniques presented 
are by no means the only methods of analysis available. In 
order to permit the reader without a thorough background in 
theoretical statistics to gain an understanding of the 
material presented, the discussion of the theoretical proofs 
and principles involved have been held to a minimum. 
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CHAPTER II 
FREQUENCY CURVES 

Introduction 

One of the most Important practical problems In mathe- 
matical statistics Is the obtaining of a relatively simple 
yet accurate representation of the frequency distribution of 
any set of data under consideration. Some knowledge of the 
frequency distribution of a set of data should be obtained 
before a statistical analysis Is attempted. 

There are essentially three methods of describing 
frequency distributions of one variable; namely, the graphi- 
cal method, the method of averages and dispersions, and the 
method of theoretical frequency functions or curves. These 
three methods of describing frequency curves will be briefly 
compared to their relative merits. 

The graphical method allov/s a large amount of data to 
be condensed to an easily presentable form. An Inherent 
v;eakness of this method Is the Inability to quantitatively 
compare distributions of two or more sets of data. One may 
state that two distributions are somewhat the same or that 
they are somev/hat different, but the degree of sameness or 
difference cannot be quantified by observation alone. This 
lack of numerical description of the distribution by the 
graphical method precludes Its use as a comparing method 
except In the most elementary studies. 

Figure 1 shows txiro histograms, one representing the 
frequency distribution of the porosity of Field 1 and uhe 
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The second method. Involving the use of averages and 
dispersion, does give a numerical description of the data 
in Figure 1, but it does not give a functional relation 
between the values of the variable X and the corresponding 
frequencies. The mimerical description in terms of X and 
<7 V7here: 



X = Arithmetic mean 
a = Standard deviation 



Field X a 

1 19. Wo 1.46 

2 19.7^ 1.17 
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This numerical description would further indicate that the 
distributions are nearly the same which again would be some- 
what misleading. 

The third method, an analytical method of describing 
frequency distributions shov;s that Field 1 is a generalized 
frequency distribution of the Pearson Type Ig, whereas Field 
2 is of the Pearson Type and may be described by the 
parameters = .729, 5 = -.294, and = 1.86, 6 = .l4l, 
respectively. This method indicates that the distributions 
of the tv/o fields are not similar which is contrary to v;hat 
one iTOuld assume by use of the first two methods of describ- 
ing frequency distributions. 

The reader should not be unduly concerned at this point 

2 

as to how the values of cu and 6 v;ere determined and what 

5 

Type Ig and IVg mean as the following sections T^ill present 
a thorough discussion of Pearson's generalized frequency curves. 



Pearson's Generalized Frequency Curves 



After it was recognized that the Gaussian or normal 



curve failed to describe the distribution of many of the 
observed data, Pearson proceeded to develop generalized 
frequency curves that could characterize the various types 
of unimodal frequency distributions encountered. 

In deciding on a system of curves for describing 
frequency distributions, it \^as realized that: 

1. Any expression used should be a graduation formula, i.e,, 
it must remove the roughness of the data. 

2. An expression, should not involve too many high moments 
to calculate constants, for thereby accuracy is reduced. 

3- There should be a systematic method of analysis applicable 
to all possible types of frequency distributions. 



Then, considering the most obvious characteristics of frequency 
distributions, it may be considered that they generally start 
at zero, rise to maximum, and then fall at the same or often 
at a different rate. At the end of the distribution there is 
often high contact. Mathematically, the above implies that 
a series of equations Y = f(x) or Y = f(t), must be chosen 
so that in each equation of the series dy/dx or dy/dt = 0 for 
certain values of x or t, namely at the maximum and at the 
end of the curve where there is contact with the axis of x or t. 

The above suggests that the frequency function may be 
represented as a solution 'of the differential equation: 



( 2 - 1 ) 

since: 



dt f (t) 



a. For a value of t, t = a,, dy/dt = 0 and the required 
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maximum is obtained and, 

b. As Y approaches zero, the derivative dy/dt also approaches 
zero thus giving contact at one end of the curve. 



Assuming that f (t) may be expanded in a converging 
power series, equation (2-1) may be iAn:‘itten: ^ 



( 2 - 2 ) 



1_ ^ a - t 

Y dt “ b^' + b-i t + b^ . 2 + 
o 1 2 t 



where the mean of the distribution is taken as the origin, 
and the abscissae’ are measured in units of the standard 
deviation such that: 



(2-5) 



t 



distance from origin 
a 



The above suggests that significant frequency functions 

Y = f (t) may be found among the solutions of (2-2) subject to 

2 

the following restrictions: 



(2-4) 

(2-5) 

( 2 - 6 ) 




f (t) dt = 1 



t f(t) dt = 0 



t^ f (t) dt = 1 



where: 



a 



n 



hi = f 



CO 



t^ f(t) dt 



(2-7) 
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and, 

(2-7a) 
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a 



3 







= 



M-4 




See Chapter III for definition of the moments \j^, ... 

Clearing (2-2) of fractions, multiplying through hy t^ and 

integrating over the range r to s (where r and s are the 

extremes of the range of variation for t) with respect to t 
■5 

gives: 

g 

(2-8) [a/t^Ydt-b Jt^dy-bJt'^+^dy-bJt^+2^y-/t^-^^Ydt ] =0 

° ^ t=r 



But through integration by parts: 

S S 

(2-8a) [/t^ dy ] = [t^ Y - n/t^~^ Y dt ] 

t=r t=r 

and if the frequency function, when multiplied by t^ vanishes 

ij. 

at the limits of the distribution, r and s, we have that: 

S „ 

(2-9) I t^ dy = -n N a , 

t=r ^ ^ 

which leads to the recursion formula for moments:-^ 



(2-9a) a+n b^ + (n+1) a^bj_ + (nrt-2) 

Giving n successively the values 0, 1, 2 ..., from (2-8) and 

(2-9) noting that N the total frequency cancels out and 

6 

letting = 1, = 0, = 1, one obtains: 
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a + + 

b^ + 3 b^ + 

(2-10) a + 5 b^ + 4 b^ + 

3^0 + + 5 tt2^ b^ + 

^ “ n -1 “ n +1 ^2 



a, a + 
3 



% a 



= 0 
= 1 
= 0^ 

= “4 

= a 

n+1 



Assuming that f(t) converges so rapidly that terms involving 
the third and higher powers of t may be neglected, a simul- 
taneous solution of (2-10) yields: 



" °3 

^ - 2 ( 1 + 26 }^ 

( 2 - 11 ) 

^ _ 2+6 
o 2(1+26)" 

7 

where, ' 

( 2 - 12 ) 6 



3 

^ 2 ( 1 + 26 ) 
^2 “ 2 ( 1 + 26 ) 
p 

2 - 5 - 6 
+ 3 



The value of a, where: 

(2-15) a = 2(1+26) 

represents the distance between the mean and the mode, which 
is defined by Pearson as the skewness of the distribution. 
Thus from the differential equation: 



dy _ a - t a - t 

(2-2) Y t + b^ t^ 

which has its basis in the theory of probability, the para- 
meters a, b^, b^, ■ and b^ have been determined in terms of 
the moments and expressed in terms of ou and 6. This 
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transformation is desirable to permit the use of the 6) 

chart for Identification of the appropriate type curve that 
fits a set of data. With the above parameters defined, 
Pearson's family of generalized frequency curves may be 
developed. 

In the family of curves there are three main types, 
nine transitional types which are special conditions of the 
three main types and the normal curve. 



Integration of the Differential Equation 

• p 

For 5 / 0, bg / 0, the denominator b^ + b^ t + b^ t Is 
a quadratic which can be written In the form bgCt - r^)(t - r^) 



Thus, 



(2-14) 1 Sz 



a - t 



Y dt b^ + b^ t + bg t bgCt - r^)(t - r^) 



Inhere: 



(2-15) r^^ = 






2 b. 



and r^ = 



- h - - 4b2b„ 



2 b. 



Upon substitution of and 5 In (2-15)# a, b^, b^, and 

- 45(5+2) - + ^/5 



(2-15a) r^^ 

where: 



2 6 



D = - 46(6+2) and 



2 6 



r_ = - a.. 
2 - 5 



- 46(6+2) - - y/P 



2 6 



2 6 



(2-15b) 
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Also, by the method of partial fractions. 
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(2-16) 



a - t 






1 B 

+ ] where 



bgCt-ri) (t-r^) bg t-r^ t-r^ 



(2-17) 



a-r. 



a-r^ 






and B = 



ri-r^ 



r^-ri 



Hence from (2-1^): 

(a-r^) dt 

f; Tt^ TE^ 



(2-18) 1 dy = ^ Trr-4-r -m-^r-r + ^ ^ 



Upon integration of (2-18): 



(2-19) Log Y 



Hence: 

( 2 - 20 ) 



l°s(t-rp + los(t-i-2)+l°e 0 



Y = C(t-r^) 



, a-r. a-r 

1-) L-( 2_\ 

2 12 ) 2 2 1 



( 2 - 21 ) 



1 a-r. 



1 a-r. 



Let ”2 = 



Thus (2-20) reduces to: 



( 2 - 22 ) 



m, m, 

Y = C(t-r3_) (t-rg) ^ 



Upon substitution of and 5 in (2-21) 



m. 



(2-23) 



1+6 ou 1+26 

1 = (- 5 -)(— ) - (— 5 -) and 



1+6 cu 1+26 

S' ■ 



For -^ < 6 < 0, the r ' s are real and opposite in sign; for 
6 > 0, and < 46(6+2), the r's are complex; and for 6 > 0 



r 
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and ^5(5+2) < the r's are real and of the same sign. 
These three conditions with the additional condition that 

0 establish the criteria for the three main types of 
Pearson's frequency functions designated Type I, IV, and 
VI. The boundaries of these areas, the curve (2+3^) = 

4(1+26)^ (2+6) which intersects the Type I and Type VT areas 
and the line 6 = - I/2 contains the points which correspond 
to the transitional t3rpes. The numbering system of the main 
types, i.e., I, IV, VI, is that established by Pearson and 
is used in this study to provide a standard reference to 
other literature on the family of curves. 

Analysis of Data 

For the purposes of analyzing the field data under 
consideration, it will be shown that Pearson's three main 
types of curves: Type I, Type IV, Type VT, plus the 

transitional Type III and the normal curve will suffice to 
describe the data. Therefore, the development of the other 
transitional types will not be described in detail. Only 
the equations and conditions are shoxm in Table III.^*^ On 
the following pages derivations are given for the five types 
of frequency distribution functions used in this thesis. 



X. Type I 

VJhen the r's in (2-14) are opposite in sign, (2-22) is 



written as: 



m, m^ 

Y = C(t-r^) (r^-t) 



(2-24) 



f 



i 

i 
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Equation (2-24) is called Type I of the Pearson system. The 
conditions on cu and 6 are: 

/- 0 , -1 < 6 < 0 (6 / - 1 / 2 ), (^ 36 ) ^ (^ 6 ) 

C is determined by setting the area i.e., the total probability 
equal to unity or, 

r m- m„ 

(2-25) C / ^ (t-r. ) (r -t) dt = 1 

^1 

Substituting: 

t-r 

(2-26) W = 

^2 ^1 

1 m, m, + m^ + 1 m^ 

(2-27) Of W dW = 1 

11 

Hence from the definition of the Beta function: 

m, + m^ +1 

(2-28) B(m^ + 1, ra^ + 1) = 1 

Thus C may be expressed either in terms of the Beta function, 

or alternately in terms of the Gamma function, through use 

12 

of an identity given by VJhitaker and Watson: 



(2-29) 



pl3 ^ 1 

m, +m„+l 

(rg-ri) p(m^+l,M2+l) 

1 r(m^+m2+2) 

m., +mQ+l 

(i’2"^l^ r(m^+l)r(m2+l) 



The other parameters are: 



22 



"*1 = 



- 0 _. + y/5 
2^ ^ 



-cu -V® 
^2 = 2 - 5 — 



m. 



= . (i^)(i .1, 



for QL^> 0 , < 0 < 3:’2 1^2l* 



”>o = - (^)(i + iz) 

vP 



^) .1 



The range of the curve is (r^, The curve will be U- 

shaped if both m's are < 0, J-shaped if the m's are oppos' 

l4 

in sign, and bell-shaped if both m's are > 0. 



Type rv 

Conditions: (X^ / 0, 6 > 0, ql^ < 4(6+2). The condi- 
tions imply that the r's in (2-14) are complex and therefore 
the second main type of curve. Type TV, may be determined. 
Thus (2-I5) can be written: 

-0^ + 



26 



= -r + iS where r 






and S = 






26 



■“5 

^ 



-iV^ 



2 6 



= -r -iS 



„ _ VI 

mi - 2 - m 



vjhere: 



m^ 2 — ^ 



V 



^ and M = i±|i 






Thus: 

(2-22) 



m, m„ 

Y = C(t-r^) (t-rg) becomes 



I 



I 
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Yl 

— m P 

(2-30) ^ c[(t+r)2 + S^] (^1^) 

and since: 

cl 

/-a-bi\ ^ „c tan”^ b/a ^c(— - tan"^ aA) 
'^a+bT^ - ® - e d 



the frequency function can be witten: 



(2-31) Y = C[(t+r)^ 4- S^] e ^"S“^ e ^ 

To determine C on setting the area of the curve over the 
interval (-oo, <»), equal to unity, Cralg^'^ shov/s that: 



(2-32) 

v/here: 

(2-33) 



,13 



,2m-l 



G( 2 m- 2 , V ) 



G(2m -2, V ) = / sin^^^“^^ 0 e ^ ^ 



d 0 



The term 0 is defined as: 
(2-33a) 



t+r 



0 = (| - tan"^ S ) 



The function G(2m -2, v ) is obtainable in tabular form from 
1 8 

Pearson. 

For this thesis, the function G(2m -2, v ) was generated 

by a special numerical integration procedure involving 

iq 

Gaussian coefficients. 



Type VI 

Conditions: / 0, 6 > 0, > 46(6+2). The 

conditions imply that the r's in (2-14) are real and of the 



I 



I 



same sign, thus the third, main tjrpe of curve. Type VI, is 
obtained. 



(2-22) 


m, m^ 

Y = C(t-r^) (t-r^) ^ 



An alternative simplified form is: 

m, 

(2-3^-) Y = C(Z (Z - a) 

17 

Craig shows that: 



(2-35) C^5 = 


r(-m 2 ) 

(m,+m^+l) 

,r(m^+l) r(-m 2 -m^ -l) a 



vjhere: 



( 2 - 36 ) 


Z = t-r^ 


(2-37) 


P 

(1 

H 

1 

ro 



Range of curve is (r^, co). 





Normal Tsrpe Curve 


Conditions: 


=5 =0. The original differential 


equation: 




(2-2) 


1 dy a - t 

Y dt ^0 ^1 ^2 



reduces to: 



(2-38) 


i ^ = -t 

'Y dt ^ 



when: 



=5=0, since from equation (2-11) 



( 



I 



I 

I 

I 



I 
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a, bT , b„ = 0 and b^ = 1 
^1-^2 o 



Hence: 

(2-38) 

Upon integration: 
(2-59) 

(2-40) 

Vfhere: 



^ = -t dt 



Log Y = - ^ + log C 



- t‘ 



Y = C e 2 



,20 N 



o 



where N = total number of observations. 



T3rpe III 

Conditions: / 0, 5=0. From equation (2-11) for 



6 = 0 : 



2 






= I = 1 



bg = 0 



Therefore the differential equation: 



( 2 - 2 ) 

becomes: 

(2-41) 



i ^ = 



a - t 



Y dt b^ + b^ t + b^ t' 






Y dt 



1 + 



1 + ^ t 



Which yields after Integration: 



21 



I 




5'iT^" 



m 

i 



I 



I 



-1 
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- 2 



( 2 -^ 2 ) 



ou *ou' 
Y = C(1 + ^ t) ^ 



“5 



Let A = 






and: 

( 2 - 43 ) 

\>;here: 



. A^-1 -At 

Y = C i(A+t) e 



0 1 = Ci 



_16 



and Craig shows, that: 



( 2 - 44 ) 



13 A 



A^ 2 



Since it is the purpose of this thesis to utilize 
Pearson's frequency curves in fitting the field data under 
consideration, rather than to develop the mathematical 
theory upon which the curves are based, which is a complete 
thesis in itself, only limited discussion of the development 
of the curves has been presented. For a more complete 
coverage of the development of these curves the following 
sources may be consulted. The foregoing discussion and 
derivations represent a synopsis of information given in 
the follov7ing references: 

a. Annals of Mathematical Statistics , Volume VII, 1956, "A 
New Exposition and Chart for the Pearson System of 
Frequency Curves" by Cecil C. Craig. 

b. Frequency Curves and Correlation, by ¥. P. Elderton, C. 



Cambridge, 1958 • 



] 
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c. "Karl Pearson's System of Generalized Frequency Curves", 
by Arnold M. Wedel, Thesis, University of Kansas, 19^1-8. 

d. Handbook of Mathematical Statistics , "Frequency Curves", 
by H. C. Carver, pp. 92-119, 1924. 

The entire family of curves is shown in Table III. The 
main types are shovm followed in order by the transitional 
types associated with a particular main type. This study 
has been limited to the use of the three main types, the 
transitional Type III, and the Normal Curve, These five 
curves adequately describe the field data used for this study. 

(g^^, 5) Chart 

In the course of the preceeding discussion a set of 
conditions for the various types of functions has been 
established in terms of and 6, parameters which may be 
readily calculated. The numerical values of these two 
parameters determine the Pearson curve appropriate to a 
particular distribution. The conditions for each type of 
curve are summarized in Table III. 

An (g^ , 6) chart which gives visual presentation of 
these conditions and an automatic means for type identifica- 
tion is relatively easy to construct. 

In addition to the lines, 6 = -1, 6 = - l/2, 6 = 0, 6 
= 2 / 5 , and g^ = 0, the chart contains only the curve: 

( 2 - 45 ) g^^ = 46(6+2) 




I 

I 



I 

i 

i 

I 
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TYPE 


EQUATION 1 


CONDITIONS 1 


REMARI^ 


!• 




- !<-S *T 

'i-f 3 sjc^y *• ^ 


Limited range, skew, 
usually bell-shaped 
but may be U-ahaped 
;or J-shaped* Range 
is (r-, , r?,) • 


II 


y ■ c 


0^, = 0 


Limited range, sym- 
metricaljusually bell- 
shaped but U-shaped 
v/hen 1 * 8 , Range 

is (-3, a) • 


III 


/ ffir 




Unlimited range in 
one direction, skew; 
bell-shaped, but may 
bo J-shaped. Range is 
( "“A , 0® )/ • 


VIII 




<✓3 (5 

s<--^ 

Cf+ y- SJ 


J-ahaped. Range from 
Infinite ordinate at 
ri to finite ordinate 
at t = rs. 


IX 




c 33 d :0 

~i 


Range from t = r» to 
infinite ordinate at 
r^* J-shaped* 


X 


y . 


S=a 


J-shaped with range 

(—1, 00 ) 0 


XII 






J-shaped with range 
(ri, rsL 


IV. 


y- x'^A ^ 


, oCs-y^ 0 


Unlimited range, 
skew, bell-shaped. 


V 


y-= C 


! *^3 

v^Csi--*-) 


Unlimited range in 
one direction, bell- 
shaped. Range isC-r,,*?] 


VII 


/.- c( 7 t'yA^j'" 


ef^s 0 

- if ? o_ _ . 


Unlimited range, sym- 
metrical, bell-shaped* 


Nor** 

mal 




S ~ ^ 


Unlimited range, sym- 
metrical, be 11-shaped* 


VI 




<r > <0 
>*f S' rs-ft~) 


Unlimited range in one^ 
direction, skew, bell- 
shaped but may bo J— 
shaped. 


XI 


y. c(j:.A^y^^ , 


y 0 

6 <-S ^ 


J-chaped with finite 
ordinate at t = rr. 
Range is (r^, <w? I. 



TABI£ III Summary of Pearson System of Frequency Curves 
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on v;hich the points corresponding to the Tsrpe V function lie, 
and the curve: 

(2-46) (2 + 35) = 4(1 + 26)^ (2 + 6) 

on which the points corresponding to the distribution 

22 

functions of Type VII, IX, X, and XI are found. 

p 

Construction of (o^ , 6) Chart 

2 

Point =. 0, 6 = 0 satisfies the conditions for the 

normal curve and is the starting point for constructing the 
graph. The lines 6 = -1, 6 = - 1/2, 

= 0, are easily constructed. 

For the equation: 

(2-45) = 46(6+2) 



For the equation: 

(2-46) (2+36) = 4(1+26)^ (2+6) 



Note when 6 = - 2/5, the expression (2+36) = 0, therefore 
the line for this equation approaches - 2/5 asymtopically. 



6=0, 6 = 2 / 5 , and 



6 




0. 


0. 


.1 


.84 


. 2 


1.76 


.3 


2.73 


.4 


3.84 



6 




0. 


4. 


. 2 


6.63' 


.4 


9.72 


-. 2 


1.85 


-.4 


.32 


-.5 


0. 


-.6 


1.12 
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Figure 2 presents visually the family of Pearson's 

p 

curves in terms of and 6. It is a simple matter to 

determine the type of curve that fits any set of data 

2 

merely by determining the ct^ and 6 values for the data and 
then entering the chart v/ith these values. 

The subscript, B, on the chart refers to bell-shaped 
curves, the subscript, J, refers to J-shaped curves, and 
the subscript, U, to U-shaped curves. 

The points for 6 < -1, correspond to no frequency 
functions, they fall in the "Impossible Area. " 

Pearson designated as heterotypic those members of his 
system for which the eighth movement failed to exist. (In 
such a case the standard deviation of the fourth moment in 
samples would be infinite. ) This area is established by 
5 ^ A. 




i 

i 

I 
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Impossible Area 
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FIGURE 2 CHART OP PEARSON'S CURVES IN TERMS OF 6) 
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CHAPTER III 

APPLICATION OP PEARSON CURVES 

Introdiiction 

The brief theoretical discussion of the Pearson 
frequency curves having been presented it is now useful to 
examine hov; these curves may be applied to the field data 
under consideration. 

Prom Table I there are eighteen sets of data available 
for the five fields. The Pearson system of frequency 
curves v/ill be applied to find a good theoretical fit for 
each given observed distribution. 

I'fnat is the value of frequency curves? A normal curve 
fitted to a given set of data is to determine ^^^hether or 
not the data are normally distributed. If the distribution 
is normal then use may be made of the extensive body of 
sampling theory applicable to normal populations. Compara- 
tively little is knom concerning sampling from non-normally 
distributed populations. 

T'Jhen data are distinguished as non-normal, there may 
be further advantages in fitting a non-normal curve to the 
data. Such a curve may serve to smooth the histogram and may 
thus permit a more accurate determination of the relative 
frequencies of the population from v/hich the sample v/as 
taken. The identification of the distribution of the given 
data with a particular frequency curve may also serve to 
distinguish them from other data, the distribution of \fnich 
is identified with a different frequency curve. These are 



1 

i 

I 

I 
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the t\'70 principle reasons for fitting non-normal frequency 
curves. Also, if a particular type of non-normal distri- 
bution occurs X'7ith sufficient frequency, this fact will 
serve as an incentive for the creation of the appropriate 
sampling theory. 

Fitting the Data 

The arithmetical labor involved in fitting a set of 
observed data to a frequency curve is lengthy and tedious. 

This may be a prime reason v;hy there has not been greater 
utilization of theoretical curves in practice to obtain a 
description of the distribution of a given set of data. 

With the advent of high speed digital computers this 
objection to the heavy arithmetical work involved is lessened. 

To illustrate the procedure involved in fitting a set 
of data to a Pearson frequency curve, the step by step 
calculations for one set of data, that of permeability for 
Field is shovm. The calculations for fitting the remain- 
ing sets of data vrere performed on a digital computer. 

The computer programs, x-nrltten in Fortran language, with 
accompanying flow charts are shovm in Appendix B. For each 
data set, the histogram and plotted curve follov;s the 
numerical calculations for that set. Tne normal curve has 
also been fitted to each set of data to illustrate the 

i 

comparison vriLth the non-normal curve that describes the 
data. In many cases the goodn.ess of fit test for the normal 
curve indicates that the normal curve gives a poor fit for 




I 

I 

I 

I 




the data^ v;hlch is as one would expect for some of the more 
markedly skewed distributions. In a fev; cases though, the 
normal curve does give a satisfactory fit to the data, 
even though the normal fit is not as good as the frequency 
curve describing the data. 

Pr'ocedure 

The procedure used in fitting the various types of 
curves is: 

1. Arrange the data in an array. Tabulate the data using 
convenient class intervals. 

2. Calculate the first four moments about a convenient 
vertical. 

3 . Transfer the moments to the centroid vertical or 
vertical through the mean. 

4. Apply Sheppard's corrections^ to the moments if there 

is high contact at both ends of the curve. 

2 

5 . Calculate and 6. 

6. Locate the mean X and the mode jl. 

7 . Determine by use of the (o^ , 6) chart what type of 
curve should be used. 

8. Calculate the constants for the equation of the curve. 

9 . Calculate the theoretical frequencies at the mid-point 
of each class interval. 

10. Plot the histogram. 

11. Plot the theoretical curve constructing the mid-ordinates 
at the middle of each class Interval. 
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12. Calculate the area graduation for each interval. Test 
for goodness of fit of theory to observation. 

13 . Pit the normal curve to the data for comparison and 
test for goodness of fit of the normal curve. 

Sample Calculations 

The permeability data for Field 3 consists of 195 

observations obtained from seven core analyses from seven 

v/ells. These data were divided into l4 class intervals of 

10 millidarcies each. Table IV shows a convenient method 

of calculating the first four moments for the core data. 

Follov;ing Table IV are the calculations needed to fit the 

frequency curve to the data. 

The fitting of a set of data to a Pearson curve is 

based on the method of moments where the r moment around 

2 

an arbitrary origin is: 

00 

(3-1) = f dP(X) 

-CO 

The zero-th moment about the origin, always exists and 

is equal to one. 

"bh. 

The r moment around the mean is: 

00 

( 3 - 2 ) U-r = f (X u,') dP(X) 

—00 

If the moments around an arbitrary origin are kno vrn, 
the follov/ing formula is used to find the moments about the 
3 

mean: 
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( 3 - 3 ) 





Thus: 




For ease of hand calculation and simplicity of notation, ' 
the scale for the independent variable of the frequency 
distribution, (i.e., porosity, permeability, or saturation) 
v;as transformed to a notation wherein the interval contain- 
ing the arbitrarily chosen mid-point is numbered zero and 
intervals on either side are numbered serially. The 
negative values were assigned to that side of the distribu- 
tion vjhich contained the mode. This convention is in keeping 
vriLth that adopted for the sign of "a" in the derivation of 
the Pearson system of frequenc3r curves. An additional 
transformation of the independent variable is made to correspond 
to standard statistical notation. This transformation is 
the introduction of the standard unit t. Thus vjrhen the 
histogram intervals are numbered serially. 



( 3 - 4 ) 



t 



a 



( 



I' 



1 




where takes on the values 1 , 2 , etc. at Interval 

mid-points. 

The equation relating t and, the original physical 
property, q, is: 



v/here q is the mean value of q and Aq is the Interval of q 
used to construct the histogram on the q scale. 

The following calculations starting with Table IV are 
for the permeability observations for Field 3. For each 
of the other data sets only the resulting constants and 
equations are shorn follov/ed by the graph of the frequency 
curve and normal curve. 

Taking moments about the arbitrary mid-ordinate ^5*0 
in terms of the transformed variable x with its correspond 
ing mid-point, x = 0, the following moments are calculated 



(3-5) 



q = t a Aq + q 




(3-8) 



(3-7) 





.4 



TABLE IV 

CALCULATION OP THE FIRST FOUR MOMENTS OP THE DISTRIBUTION OP PERMEABILITY- -FI ELD 5 
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And, for moments about the mean. 



= 0 

lig = Hg' - \ x ^'^ = 6.4615585 - (-.46155846)^ 

JJ.2 = 6.2485208 

= 4.9250769 - 5 (-.■^6155846) (6. 461558) + 2(-. 46l5584<6) 
= 15.675191 

2 

= U-4 ' - 4n^ ' ' -f 6ij.^ ' iig ' - 5ix^ ' 

\ xi ^ = I48.4lll6 

And the standard deviation for grouped data, 

(5-9) cJ = Vi ^ =^/6.1651«75 = 2.4997055 



Hence Sheppard's corrections are: 



( 5 - 10 ) 



( 5 - 11 ) 

( 5 - 12 ) 



Thus, 



— 1^0 ~ 



12 



= 6.2485208 - .085555 



Xg = 6.1651875 



x^ = = 15.675191 



x„ = 



1 7 

*^4 " 2 *^2 240 



X2^ = 148.41116 - |(6. 2485208) + .029167 
x^, = 145.51607 



X,' 



. ^ .79781261 



x 7 (6.1651875)^ 



I 

1 








40 



= 




145.31607 145.31607 

(0^1551^75 - 3«. 0095354 

2 a; - 3 cu^ - 6 
5 = ± 2 



+ 3 



3.8231476 



2(3.8231476) -3 (.79781261) - 6 
( 3 . 8231476 ) -I- 3 



6 = -.10950116 

D = - 45(6+2) 



D = .79781261 -4 (-.10950116) (1.8904988) 

D = 1.6258600 

Then entering the (o^ , 6) chart with: 

= .79781261 and 6 = -.10950116 

3 

it is observed that the frequency distribution of the 
permeability for Field 3 5e fitted by a Pearson Type 
curve. 

The equation of the curve is: 

m, 

Y = C(t-r3_) (rg-t) ^ 

where: 

N r(m, + m„ + 2) 

n _ 

ra,4-m„+l 

(r^-r^) r(m^+m2) r(m2+l) 

where N is the sum of the frequency for all class intervals 



r 



1 



■0^ +^/g 
2 6 



and: 



I 



I 




■f 




"*1 = 



89320367 4-Vl. b25ijb0 
2 (-.10950116) 



= -1.7^37633 

-OL. - yo 



r = 2 

^2 



2 5 



r>2 = 9.9007883 






m. 



= -(^) (1 - -1 



mi = 



- ( 



1 - .10950116 
-.10950116 



)(1 



.89320367 
2750919 



) - 1 



m^ = 1.435619^ 



- 1 



And: 

C 



m^ = 12.829027 



195 r(l.^356l9^) + 12.829027 + 2) 

(9.9007883 + 1.7^37633)^^*^^'^^^^ n^. 264646 ri3. 829027 



C = 



195 ri6. 2646464 . 

(11.6445516)^^*^'^ ri4. 264646 ri3. 829027 



C = 2.211844 X 10“^^ 

4 

The Gamma function, defined as: 



( 3 - 12 ) 



r(x) = / e"^ dt, X > 



was calculated in two v;ays, depending upon the value of x. 



42 

For values of x less than 9.0, the Gamma function X'/as obtained 
from the recursion formula: ^ 

D-13) r(x+l) = X r(x) ■ 

X'/hich Is useful for expressing F(x) as a function of some 
value of r(a) X'/here (a) is a value betx'/een 1. and 2. The 
value of r(a) v/as. determined by a table look-up and 
interpolation using tabulated Gamma functions.^ 

For values of x above 9-0, F(x) x^;as computed from the 
Stirling formula"^ for log F(x). 

log r(x) = 45429448 (x) + (x -.5) l^Sio ^ 

(5-14) 

+ log^Q (1 + + .39908993 

9 

The equation of the curve is: 

Y = (2.211844x10"^^) (t+1. 7437633 )^*^^^^^^^ (9.90079-t)^^'®^^ 

The range of the curve can be computed from the above 
equation. 

The range of the. carve is defined as the upper and 
lov;er values of the independent variable betx^een x-7hich all 
positive frequencies exist. These theoretical limits are 
determined by seeking the values of t for v;hlch Y = 0. By 
Inspection of the above equation, Y is zero x^hen t is -1.7437633 
and + 9 . 90070 . These values of t correspond to permeabilities 
of - 2.65 md and 288.55 respectively as determined, from the 
equation. 



q=q+t aAq 



I 
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I'/here t is one of the limiting values given above, a the 
standard deviation in units of x, A q is the class Interval 
in units of q, and q is the mean value of q. Because negative 
values- of permeability are not physically meaningful, the 
limits can be looked upon as zero and 288.55 millidarcys. 

The existence of the negative lower limit should present 
no more interpretational difficulties than those presented 
by the Imovm limits of any normal frequency curve which are 
by definition, -«> and -r«>. 



For calculation of graduation (mid-ordinates), the 
follov;ing arrangement is convenient: 



(1) 


(2) 


( 3 ) 


( 4 ) 


( 5 ) 


(6) 


Mid-point 


Distance from 
origin 


=. (|) 


( 3 ) - 


log( 4 ) 




65 


2. 46155846 


.98473219 


2.7284955 


. 43592 


8.9160561 



(7) 


( 8 ) 


( 9 ) 


( 10 ) 


( 11 ) 


( 12 ) 

Area of 


Log( 6 ) 


m^x(5) 


mgx(7) 


( 3 )-h ( 9 )-FLogC =LogY 

A 




Interval 


. 95017 


.624807 


12.18973 


1.16038 


14.467 


14.512 



The area of the interval is found by means of Simpson’s 
Rule:® 

1 ■ 

(5-15) I = 1/6(Y^ + /2 + ^1^ 

o ' 



By use of the above calculating techniques, the values 
of Y,. (Graduation-mid-ordinates) and Graduation (Areas) v;as 
found for each interval. Additionally the values of 



I 



1 

I 

I 

I 
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(Graduation-mid-ordinates) v;as found for each raid-point by 
substituting the appropriate value of t for each mid-point 
into the equation: _ ^2 



v;here: 



Thus: 



C 



N 



_N 

a 



^65 



= 195 

a 



-(. 9847 )^ 

2 



= 19.164 



The follov;ing summary for Field 3 permeability data is 
presented: Type Curve 

Y = ( 2.21184 X 10 "^^) (t 1.7437633)^*'^^^^^^^ ( 9 . 90079 -t)^“*®^^ 



Permeability „ Graduation Graduation Normal Curve 

Midpoint (md) Y (mid-ordinates) (Areas) (mid-ordinates) 
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14 . 48 


11. 43 


15 


29 


29.14 


28.72 


18. 53 


25 


34 


33.85 


33.57 


25.75 


35 


32 


31.81 


31. 68 


30. 4 l 


45 


24 


26.47 


26.44 


30.60 


55 


20 


20. 23 
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26. 23 


65 


16 


14.47 


14.51 


19.16 


75 


11 


9.77 


9.82 


11.93 


85 


n 
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/ 

ro 


6.31 


6.33 
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3.82 
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2.85 
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2. 22 


2 . 24 


1.10 


115 
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1. 22 


1 . 24 


.36 


125 
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.64 


.65 


.10 


135 


1 





112 


.02 




195 


194.95 


194.09 


184.86 
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Goodness of Fit 



p 

The Chi Square (X ) goodness of fit test for the above 
curve may be calculated by the folloxiri.ng method: 

Chi-Square "Goodness of Pit" 



Graduated f - p (f - P)^ (f - 



15 


14.5 


.5 


.25 


.018 

• 


29 


28.7 


.3 


.09 


.003 


34 


33.6 


.4 


.16 


.006 


32 


31.7 


.3 


.09 


• 

0 

0 


2 ^ 


26.4 


- 2.4 


5.76 


. 225 


20 


20.3 


- .3 


.09 


0 

0 


16 


14.5 


1.5 


2.25 


.152 


11 


9.8 


1.2 


1.44 


.142 


8 


6.3 


1.7 


2.89 


.444 


2 


3.8 


-1.9 


3.61 


.893 


1 


2.2 


- 1.2 


1.44 


.689 


1 


1 . 2 


- .2 


.04 


.046 


1 


.7 


.3 


.09 


.188 


1 


il 

194 . 1 


.7 


.21 


.700 

3.522 



The number of degrees of freedom for this curve is n - 
1 = 13, where n is the number of class Intervals. Entering 
a X table^ v/ith I3 degrees of freedom and X = 3*522 an 

approximate value of p = .99 is found. This indicates 

that there are 99 chances out of 100 that differences as 

large as those found could have arisen due to chance or 

sampling variation. Therefore, we can conclude that the 

Type Ig gives an exceptionally good fit to the pemeability 

data. Por testing the goodness of fit the significance 

level of p is usually taken as either .05 or . 01 .^*^ 
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FIGURE 3 TYPE PERMEABILITY FIELD 
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Numerical Results 
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Permeability (K) — Field 1 

Data Range (.2, 459 md. ) Mean = 65.26 md. 

Class interval = 40 md. a = 6O.8 md. 

0^2 ^ 27404 6 = -.2054506 Type Ij curve 

r^ = -.95047735 = 9.4901755 

m^ = -.5008110 m^ = 6.1512056 

C = 2.8551007 X 10"^ 

Y = (2. 8551007x10"'*^) (t+. 95047755)"*^°^^^^^^ (9.4901755-4)^'^^^^°^^ 

Curve Range ( 6 . 7 > 490 md. ) 

Ij Curve Normal Curve 

= 17. 48 = 968.2 

Fit is good at .05 level Fit is not good 



Permeability 

Midpoint Frequency 
(md. ) 



Graduation Graduation 
(mid-ordinate) (Areas) 



Normal Curve 
(mid-ordinate) 
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Porosity (0) — Field 1 
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Data Range (5*4, 25*5^) 



Class interval = 2.5^ 
a ^ = .72488482 

p 

r^ = -1.3630479 



Mean = 19 . 43^ 

a = 3.65 

5 = -.29574565 Type Ig curve 



r^ = 4. 2614796 
nig, = 2.6432928 



= .16531900 
0 = 3.7594063 

Y= (3. 7594063) (t+1. 3630479)^* ( 4. 26l4796-t)^'^^^^^^® 

Curve Range (3.83, 24.0^) 

Ig Curve Normal Curve 

= 19.841246 = 160.8 

Fit is good at .02 level Fit is not good 



Graduation Graduation Normal Curve 



Porosity 

Midpoint Frequency (jni(j_Q3^(3^j_nate) ^(Areas) (mid-ordinate) 
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Oil Saturation (S^j — Field 1 



Data Range (7.0, 75.0^) Mean = 58 . 89 ^ 

Class interval =5^ = 11 . 05 

= .020572580 6 = .027215^51 Normal Type curve 

C = 1^2.75 

_ £ 

Y = 142.75 e ^ 



Normal 

= 16.568650 



Curve Range (- 00 , 00 ) 



Fit is good at . 25 level 



Midpoint m Frequency 



Normal 

Mid-ordinates 
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FIGURE 6 NORML OIL SATURATION FIELD 1 
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Water Saturation (S ) — Field 1 

Data Range ( 15 ^ 89 ^) Mean = KO.^Gfo 

Class interval =5^ cr = 12.15 

= .72575019 5 = -.10356269 Type curve 

r^ = -1.8241159 rg = IO.038806 

= 1.6619958 nig = 13.649978 

C = 9.5915501 X 10 "^^ 

Y = (9.5915501x10"^^) (t+1. 8241159)^*^^^^^^^ (l0.938806-t)^^-^^^^^® 

Curve Range (l8.9^ 171) 

Ig Curve Normal Curve 

= 10.875972 = 95.56815 

Fit is good at .80 level Fit is not good 



_ 

w _ Graduation Graduation Normal Curve 

Midpoint {%) Y Mid-ordinates (Areas) Mid-ordinates 
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Pemeability (K) --Field 2 
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Data Range (.l 6 , J>ol.O md. ) Mean = 49.8 md. 

Class interval = 30 md. a = 53.4 md. 

o,^ = 5*4370534 6 = -.22554678 Type Ij curve 

r^ = -.71192903 r^ = 11.050699 

m^ = -.58435670 m^ = 5.4516965 

C = 1.8509951 X 10 "^ 

Y = (1.8509951x10"^^) (t+. 71192903) (ll.050699-t)5-'^5l6965 

Curve Range ( 11 . 9 ^ 636.8 md. ) 

Ij Curve Normal Curve 

= 24.96777 = 486.497 

Pit is good at . 02 level Pit is not good 



Permeability Graduation Graduation Normal Curve 

Mid-point (md. ) 9 7 (mid -ordinates) .(Areas) (mid-ordinates) 
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FIGURE 8 TYPE Ij PERMEABILITY FIELD 2 
RANGE (11.9, 656 . 8 ) 
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Porosity ( 9 ) — Field 2 
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Data Range (8.1, 24.8) Mean = 19-72^ 

Class interval = 2 fi a = 2 . 54 ^ 

cu^ = 1.5855587 <5 = .14055615 Type VIg curve 

= 4. 5488552 = -22,778045 

r^ = -1.9693580 r^ = -7.7511868 

c = 20.968745 

Y = (20.968745) (t+7. 75 11863 (t+l. 9698580)^* 5^^^'^552 

Curve Range (-«>, 24.55) 

VIg Curve Normal Curve 

= 15.456512 = 71.647549 

Fit is good at .05 level Fit is not good 



Porosity 
Midpoint (^) 



Frequency 



Graduation Graduation Normal Curve 
(mid-ordinates) Areas (mid-ordinates) 
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FIGURE 9 TYPE VI„ POROSITY FIELD 2 
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Oil Saturation (S^) — Field 2 



= .6911815 



Data Ranee (9, 79^0 Mean = 49-58/^ 

Class interval =7^ cr = 14.7 

5 = -.50506016 Type Ig curve 

r^ = -1. 5800193 
= .15792540 
c = 1.7498845 

Y = (1. 7498845) (t+1. 5600193)^'^^^^^^ (4.0852955-t)^''^^^^^^^ 

Curve Range (-10. 7^ 69.4) 



r^ = 4.0852955 
= 2.4181582 



Ig Curve 
= 21.73 7241 



Pit is good at . 01 level 



Normal Curve 

')& = 90.32 

Fit is not good 
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Midpoint 



Gi’aduation Graduation Normal 
(^) Mid-ordinate Areas (mid-ordinates) 
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Water Saturation (S ) --Field 2 

Data Range (ll, 85^) Mean =58.65^ 

Class interval =6fo a = 14.82 . 

= 1.0579891 5 = -.52958992 Type Ij curve 

r^ = -1.1787589 Tg = 4.2995634 

= -.12466560 = 2.1928124 

c = 1.8860123 

Y = (1.8860123) (t+1. 1787589)"' ( 4 . 2995634-t)^*^^^^^^^ 

Curve Range ( 21 . 2 , 102 . 3 ) 

Ij Curve Normal Curve 

X“ = 7.5586264 = 98.996255 

Pit is good at .75 level Pit is not good 
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FIGURE 11 TYPE I, WATER SATURATION FIELD 2 
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Porosity (0) — Field 3 



63 



Data Range (10.2, 23.1^) Mean = 18.06^ 



Class interval = 1.5^ 


0 = 2. 46 


= .58195^15 6 


= -.25091835 Type Ig curve 


r^ = -1.526424 


r^ = 4.5666935 


= .4957617 


= 3.4749591 



c = .115555^1 

Y = (.115555^1) (t+1. 526^.^24) ( 4 . 566693 

Curve Range ( 6 . 7 j 21.8) 



I„ Curve 

IJ 

= 17.752871 



Normal Curve 

= 36.031^58 



Fit is good at .05 level 



Fit is not good 



Porosity „ Graduation Graduation Normal 

Midpoint (^) Y Mid-ordinates Areas (mid-ordinates) 
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FIGURE 12 TYPE POROSITY FIELD 3 
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Oil Saturation (S^) --Field 3 

Data Range (I3.I, 48.?^) Mean = 33.26^ 

Class Interval = 3/^ cr = 5. 7 

= .79^3^377 <5 = .072484741 Type VIg curve 

r^ = -4.2828677 Tg = -6.6759047 

= 51.961440 mg = -83.553454 

C = 37. 514620 

Y = (37. 514620) (t+6. 6759047) (t+4. 2828677)^^* 

Curve Range (-00, 57*7) 

VIg Curve Normal Curve 

= 10.864505 = 24.761836 

Pit is good at . 50 level Pit is good at .01 level 
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° n^n,^-\r Graduation Graduation Normal 

Midpoint (^) *1 ^ Mid-ordinates Areas (mid-ordinates) 
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water Saturation (S ) — Field 5 

VI 



Data Range 76 . 1 ) Mean = 45 . 35 ^ 

Class interval = 5 ^ a = 7.85 

0:^^ = 1.4631973 5 = -.03992642 • Type curve 

A = 1.6534022 C = 13.616174 

Y = C(A + t)^ 

Y = (13.616174) (1.6534022)^-^^^5° g- 1 . 6534022 t 

Curve Range ( 32 . 4 ^ «•) 



Illg Curve 

X- = 4.7735815 

Pit is good at .85 level 



Normal Curve 

= 16.889006 

Fit is good at .05 level 
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vj _ Graduation Graduation Normal 

Midpoint (5^) ^ ^ Mid-ordinates Areas Mid-ordinates 
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Porosity (0) — Field 4 
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Data Range (9.3, 22. 4^5) Mean = 17.8^ 

Class interval =1.5^ o = 2.0 

= 1.1315972 6 = .0047189 Type curve 

A = 1.8801133 C = 18.684424 

Y = C(A + t)-^^"^ 

Y = (18.684424) (1.8801133 +t)^' ^-1.880113 t 

Curve Range (-«>, 21.57) 

Illg Curve Normal Curve 

= 6.3034068 X^ = 19.69985 

Pit is good at . 60 Pit is good at . 02 



Porosity „ Graduation Graduation Normal 

Midpoint (^) ^ Mid-ordinates Areas Mid-ordinates 
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FIGURE 15 TYPE POROSITY FIELD 4 
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Oil Saturation (S^) — Field ^ 

Data Range (13-3, 63.2^) Mean = 39*96^ 

Class interval = 3.5/^ cr = 10.1 

a_^ = .01836461 6 = -.13178302 Type curve 

r^ = -3.2859450 rg = 4.3142724 

ni^ = 4.6968277 = 6.4796347 

C = 1.1959325 X 10"5 
Y= (1.19593 25x10"^) (t-i-3 . 235945)^* ( 4.3l42724-t)^* 

Curve Range (6.9^ 83.3) 

1-3 Curve Normal Curve 

JD 

= 13.992946 = 15.734882 

Pit is good at .60 level Fit is good at .40 level 
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o Trr><=>rmf=.-noi7- Graduation Graduation Normal 

Midpoint(^) 9 - J Mid-ordinates Areas (mid-ordinates) 
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FIGURE l6 TYPE OIL SATURATION FIELD 
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llateT Saturation (S ) --Field 4 

VJ 

Data Range (l8.6, 70.8^) Mean =59-81^ 

Class Interval = 6 fo a = 15.5 

= .44508784 5 = -.41374306 Type curve 

r^ = -1.3112949 r^ = 2.9237671 

= -.122539 1^2 = .9564568 

C = 10.194623 

Y = (10.194628) (t-j-1.3112949)“‘^^^^^^ (2.9237671-t)* 

Curve Range (22.1, 79-2) 

Ig Curve Normal Curve 

X“ = 15.^06190 = 28.03342 

Pit is good at .05 level Pit is not good 
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Graduation Graduation Normal 
Mid- ordinates Areas Mid -ordinates 
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FIGURE 17 TYPE Ij WATER SATURATION FIELD 4 

RANGE (22.1, 79.2) 



7 ^ 



CO 




5 

CO 



Porosity (9) --Field 5 
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Data Range { 3 -^, 3^.3 
Class interval = 3 /^ 

= .33849171 

r = 1.1205492 
V = -3.9842932 



fj ) Mean = 20 . 66 ^ 

a = 4.32 

6 = . 25980502 Type IV curve 

m = 5.8520056 
s = 2.7291720 



C = 3.3495724 X lo’^ 

Y = (3.3405724x10’^) [(t-M. 1205492)^ -h (2. 7291720) ^]~ 5*®520056 ^ 



^ 3 . 9842932 tan 



/t-i- 1 . 1205492 N 
'• 2.7291720 ^ 



Curve Range (-«>_, 00) 



IV Curve 

= 83.10963 

Pit is not good 



Normal Curve 

= 109.93934 

Pit is not good 



Porosity „ Graduation Graduation Normal 

Midpoint (^) equen y ordinates Areas Mid-ordinates 
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FIGURE 18 TYPE IV POROSITY FIELD 
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Oil Saturation (S^) — Field 5 



Data Range (5-7, 77. 
Class interval = 8 ^ 

= .00047516776 
= -3.9580728 

= 6 . 5279175 



Mean = 32 . 77 ^ 

0 = 13.84 

6 = -.11483628 Type curve 

rg = 4.1474939 
nig = 6.8881809 



C = 2.5711286 X 10"^ 

Y = ( 2 . 5711286 x 10 "°) (t+ 3 . 9580728 )^* (4. l474939-t)°'®^^^®°^ 

Curve Range (-21. 9 j 90.1) 



Ig Curve 

= 16.2566686 



Normal Curve 

= 17.879294 



Pit is good at .05 level 



Pit is good at .04 level 



“"o Graduation Graduation Normal 

Midpoint Mid-ordinates Areas Mid-ordinates 
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FIGURE 19 TYPE OIL SATURATION PIEI.D 
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Water Saturation (S ) — Field 5 

V7 , 



Data Range (12.4, 92.3fi) Mean 52.17 

Class interval 8^ a = l6,0^ 

a_- = .-000611895 6 = -.02569028 

c = 2^4.82 

_ 

Y = 2^1-4.82 e. ^ 

Curve Range {-<x>, w) 



Normal 

= 1^.559695 

Fit is good at . 20 level 



Normal Type curve 



w . Normal 

Midpoint (^) i-requency Mid-ordinates 
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water Saturation (S,J — Field 5 

^ VI' . 



Data Range (12.4, 92 . 3 ^'^) Mean 52.17 

Class interval 8^ a = 16.0;^ 

a_“ = ; 000611895 6 - -.02569028 

c = 244.82 

_ 

Y = 244.82 e. ^ 

Curve Range (-<», co) 



Normal 

X“ = 14.559695 

Fit is good at . 20 level 



Normal Type curve 
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FIGURE 20 NORMAL WATER SATURATION FIELD 5 
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Sx^uninary 

A concise and easily visualized summary of the distri- 

p 

butions of the data may be shorn by means of the (o^ , 6) 

p 

chart. Figure 2 is the complete (ct^ , 6) chart; hoxi?ever, 
for purposes of shov/ing the distributions of the properties 
under consideration, only the necessary portion of the chart 
has been prepared for each of the field properties. 

Figure 21 shov;s the distributions for porosity for the 
five fields. There appears to be no pattern to the types 
of distributions, in that Fields 1 and 5 are of the Type Ig, 
Field 4 is a Type Field 2 is a Type VIg and Field 5 

is a Type- IV. 

«=< * 




FIGURE 21 (o^^; 6) Chart of Porosity (Normal Curve is Pt. 0.0) 
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Of the five fields, only Field 4 satisfied the goodness 
of fit test for the normal curve. This is somevj’hat in 
contrast to the conclusion of Jan Law^^ that: 

"With some exceptions permeability and porosity 
assemblages give respectively satisfactory logarithmic 
and arithmetic normal frequency distributions. " 

For the present study it was the exception where 
porosity data gave a satisfactory arithmetic normal frequency 
distribution. 

The (a^ , 6) chart for the data. Figure 22, indicates 
a somewhat different situation regarding normal distributions. ' 
Four of the five fields, 1, 5, 4, 5 , satisfied the goodness 
of fit test for normal distributions. 

0 I 2 




FIGURE 22 6) Chart of 



All five fields for S,, v;ere fitted by a Type I curve 

w 

2 

v/ith varying values of and 6. Field 5 satisfied the 
goodness of fit test for the normal curve. 




FIGURE 23 6) Chart of 

It is noted that both the and S,, distributions for 

o w 

Field 5 which has a considerably greater nvimber of samples 

included, approximately. 1700, than any of the other fields 

satisfies the goodness of fit for the normal curve. This 

could indicate ‘that if large enough samples are talcen, the 

Sq and distributions approach normality. For sampling 

considerations, the assumption that and populations 

are normally distributed would permit many established 

techniques to be applied to the analysis of these properties 

A study was made of what effect the choice of interval 

2 

size v/ould have on the ou and 6 values. ' For each set of 
data various interval t\rldths were used,'i.e. , for porosity 



1 percent, I .5 percent, 2 percent, and 3 percent interval 

2 

v/ldths, to calculate the parameters and 6. For all 

fields with the exception of Field , no appreciable dif- 

2 

ference was noted In the values of and 6 for the various 

interval sizes tried. There were only 132 observations for 

2 

Field 3^ and the calculated values of and 6 were more 

2 

sensitive to the size of interval used, vri.th ranging 
from .641 to .279 and 6 ranging from -.249 to -.099 as the 
interval size was varied from 1 to 2.5 percent in steps of 
. 5 percent. 

It was concluded from this study that with a sufficiently 

large number of observations (> 200), a v/idth chosen to give 

between 7 to I 5 intervals will give consistent results in the 

2 

calculation of ct^ and 6. 

Although it \fas not possible to study enough fields to 

determine the relation between reservoir type and the 
2 

parameters and 6 herein measured, it would be reasonable 

to assvime that such a situation exists. Specifically the 
2 

parameters and 6 vrould be expected to reflect the 

variation of individual properties throughout a single 

depositional unit. They should also characterize the deposi- 

tional unit, distinguishing it from similar units in a geologic 

2 

basin or province. Xfithin this context, the variables 
and 6 could represent properties which could be correlated 
in the geologic sense for determining stratigraphic equality. 

If data were available in sufficient quantity to yield 

2 

<x^ and 6 parameters for Individual wells, then these 
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variables could be contoured for specific reservoirs or 
used to segment the reservoir into smaller more homogeneous 
units for mathematical analysis. The values themselves 
V70uld then provide the information necessary for preparation 
of a mathematical model of the reservoir including the 
effects of heterogeneity. 

Thus in reservoir models there is no "a priori" reason 

v/hy the Gaussian normal distribution function need be used 

to study the reservoir performance. If experimental evidence 

indicates the existence of a non-normal Pearson type 

distribution function for a particular property then this 

specific distribution function could be used. 

However before quantitative use is made of the 
2 

statistics and 6, it should be emphasized that these 

apply only to the set of data from which they were extracted. 

2 

They are only estimates of the parameters and 6^ inhere 

the subscript r denotes the value of parameters for the 

population of samples vjhich comprise the entire reservoir. 

For the normal distribution which is described by the mean 

and standard deviation, techniques are well known that 

employ these sample statistics, X and a, to estimate 

population parameters. However very little information is 

2 

available for estimating population parameters and 6^ 

~ 2 

from the sample statistics ou and 6. 



1 



CHAPTER rV 

.ANALYSIS OP LOGRITHMS OP DATA 

Introduction 

Jan Lav/^ in his v;ork with core data concluded that 
"with some exceptions permeability. .. .assemblages give 
satisfactory logrlthmlc. . . .normal frequency distributions. " 

To investigate the applicability of Jan Law's conclusion to 
the core data available for the fields under study is the 
purpose of this chapter. 

It can be observed from Pigures 4 and 8 that the 
frequency distributions of the permeability of Pields 1 
and 2 are markedly skewed. The distributions are of the 
Pearson Type Ij. Does the distribution of the logrithms of 
these data approximate the normal curve? 

Logrlthmlc Distributions 

If the distributions of these data do satisfy a 

logrithmic normal frequency distribution, an accumulated 

frequency curve of the permeability data plotted on logrithmic 

2 

probability paper should follov; a straight line. 

The sample data may be cumulated and put in percentage 
form as in Tables V and VI. These ciunulative percentages 
may then be plotted on logrithmic probability paper. If 
the resulting curve is approximately a straight line, we may 
proceed with assurance to fit a normal curve to the logrithms 
of the data. 

Three cycle logrithmic probability paper was used to 
plot the cumulative percentages from Tables V and VI. 
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TABLE V Cumulative Distribution of Permeability for Field 1 



Permeability in md. Number of Measurements Percent of Total 



-less than 1.60 


150 


10 


2.89 


195 


15 


6. 25 


260 


20 


11.00 


325 


25 


16.00 


390 


30 


24.00 


455 


35 


50.00 


520 


4 o 


37.00 


585 


45 


45.00 


651 


50 


53.00 


715 


55 


65.00 


780 


60 


71.00 


845 


65 


84.00 


910 


70 


97.00 


975 


75 


111.00 


1040 


80 


126.50 


1105 


85 


151.50 


1170 


90 


185.00 


1235 


95 


459.00 


1303 


100 


TABLE VI Cumulative 


Distribution of 


Permeability for Field 2 


Permeability in md. 


Number of Measurements Percent of Total 



less than 



1.1 


31 


5 


1.8 


63 


10 


3.7 


. 94 


15 


7.0 


126 


20 


10.0 


158 


25 


15.0 


189 


30 


20.0 


220 


35 


24.9 


252 


40 


29.0 


285 


45 


32.0 


317 


50 


56.0 


546 


55 


42.0 


378 


60 


48.0 


^ 409 


65 


53.0 


441 


70 


65.0 


472 


75 


73.0 


504 


80 


90.0 


535 


85 


122.8 


572 


90 


175.0 


605 


95 


580.0 


635 


100 



The resultant curves obtained for the tv;o fields are 
shown in Figures 24 and 25 respectively. Prom visual 
observation it appears that the two curves do not approxi- 
mate straight lines. Therefore it may be concluded that 
the permeability data for these fields do not follow 
satisfactorily the logritlimic normal distribution. 

The question that now arises is if the logrlthms of the 
data are not normally distributed, v/hat distribution pattern 
do they follovj? To ansv;er this question, the permeability 
data v;ere converted to logrlthms. 

The logrlthms of the permeability of the two fields 
v;ere then treated as data to v;hich a Pearson frequency 
curve was fitted. The calculated results using the logrlthms 
are shown on the follox'/ing pages, vrith the calculations and 
figures for Fields 1 and 2 respectively shown. 

Pitting the Logrlthmlc Data to Pearson Curves 

The procedure outlined in Chapter III was used to fit 
the logrlthms of the permeability of Fields 1 and 2 to a 
Pearson type curve. The curves obtained are shown in Figures 
26 and 27 and were found to be Type Ij and Lg. Additionally 
a normal curve fitted to the histogram is shown for compara- 
tive purposes. 

To simplify the interpretation of Figures 26 and 27 ^ the 
abscissa of the curves v/ere converted to an interval scale 
V7hich is denoted as a ir scale. Tables VII and VIII show the 
logrithm value and the millidarcy value of each interval of 



the abscissa. 
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FIGURE 25 PERMEABILITY MEASUREMENTS— FIELD 2 (On Logrithmlc Probability Paper) 
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Calculations log K — Field 1 



Data Range (-I. 3 OIO 5 , 2.64246) Mean^^^ = 1.4288151 
Class Interval = *50000 a = .74250 






= .86965292 



6 = -.47453376 



Type Ij Curve 



r^ = -1.0619523 

= -.42484190 



rg = 5.0271256 
nig = .65950490 



c = 7.5824862 

Y = 7. 5824862 (t+1. 0619525 )■* ( 5 . 0271256 -t)*^^^ 5 ‘^^^° 

Range^^g (-1.17581, 2.21681) 

Normal CTurve 



Ij Curve 

= 50.534208 



X^ = 428.58022 



Pit is good at .001 level 



Pit is not good 



7 T Scale Graduation Graduation Normal Curve 

Midpoint ^ Mid-ordinates Areas Mid-ordinates 
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FIGURE 26 FREQUENCY DISTRIBUTION OP LOG K 
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Calculations log K — Field 2 

Data Range^^g (-1.2040, 2. 5800 ) = 1.573706 

Class Intervals = .4000 a = .6920 

= .625648 6 = -.2240993 Type Ig Curve 

= 1.5577193 rg = 5.0873113 

= .62525930 mg = 4.5015559 

C = 8 . 999135 ^ X 10"^ 

Y = (8. 999135toO"^) (t+1. 5577193 )*^^^^^^^^ (5.0873713-t)^*^°^^^^^ 

Range^^g (-2.14621, 2.473790) 

Ig Curve Normal Curve 

= 24.989362 X^ = 108.47524 

Pit is good at .OO 5 level Pit is not good 



TT Scale -cu^ Graduation Graduation Normal Curve 

Mid-point ^ ^ Mid-ordinates Areas (mid-ordinates) 
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Summary 



Prom the cumulative frequency curves and the logrithmic 
frequency distribution curves it may be concluded that the 
permeability data under study does not satisfactorily fit a 
logrithmic normal curve. It is not the intent of this study 
to dispute Jan Lava's conclusions that the logrithms of permea 
bility data approximate a normal curve, but more to express 
a word of caution in indiscrimately using the logrithmic 
normal curve to analyze the permeability distribution of a 
particular set of data. The data used by Jan La\7 came from 
a different geographical area and from fields with different 
depositional properties than the data for this study. 
Therefore it appears that each and every set of data must 
be analyzed to determine its applicability to the logrithmic 
normal curve. 

Logrithms do provide a means to remove some skewness 
from a set of data, but not necessarily to convert it to a 
normal distribution. If sampling techniques were developed 
for Pearson's Type distributions, the conversion of 
permeability data to logrithms for analysis could serve a 
very useful purpose. At present though, this study indicates 
that caution must be exercised in placing too great of a 

degree of reliability on conclusions drawn from the analysis 

\ 

of the distributions of the logrithms of permeability data. 
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CHAPTER V 

ANALYSIS OP VARIANCE 



Introduction 

Prom each core analysis there may be readily obtained 
a sample mean (x) and a sample variance (S^ ) where: 

ZX 

(5-1) Vll = TT 



and, 

(5-2) 




Z(X„ - X)2 

n 



where X^ = individual measurements within each well sample 
n = number of samples. 

Within a field the various means obtained from the 
different wells most likely will not be numerically identical. 
It is desired to determine whether or not the difference 
between the various means, 3^, X^ .... X^ can be explained 
by random errors, that is, the means are statistically the 
same and represent identical populations, or whether the 
means are actually different where the differences do not 
result from sampling errors. 

To illustrate the techniques involved in making these 
determinations, the data obtained from the core analysis for 
porosity of Pield 2 will be used. Prom this data the follov/- 
ing sample means and sample variances were calculated for 
the wells cored: 



Well 


Number of 
Measurements 


Mean (X^) 




Variance (S^ 

0 


1 


24 


.199 




. 000198 


2 


28 


.189 




. 000202 


5: 


26 


.200 




. 000128 


ji 

1 


27 


.205 




.000255 


5 


18 


.199 




. 000265 


6 


36 


. 186 




. 000890 


7 


14 


.184 




. 000015 


8 


23 


. 208 




. 000174 


9 


34 


.191 




. 000607 


10 


20 


.190 




. 000228 


11 


31 


.205 




. 000402 


12 


19 


. 208 




. 000894 


13 


15 


.181 




. 000946 


1^ 




.212 




.000512 




345 


2.775 






Prom 


the above it 


can be observed 


that 


the means range 


from .181 


to .212. The 


! question which 


arises, is whether 


ordinary 


random sampling errors account 


; for 


the differences 


in these 


means ^ or may 


it be concluded 


that 


the means are 



different because of reasons other than sampling fluctuations? 
The F Test 

The P test devised by R. A. Pisher^ and named in his 

honor is a means for ans\i:ering the above question. The 

basis for this test lies in the availability of tvio independent 

estimates of the population variance. Consider a single 

core analysis ^^0 consisting of all of the porosity measure- 

2 

ments from Well 1. The variance that set is a 

measure of the Internal scatter of the measurements of the 






1 

\ 

1 



population. 



Similar statements apply to S 



2 




Ho^^rever, these variances are influenced "by sampling error 

and are not necessarily equal to the population variance. 

A better estimate of the internal scatter of the measurements 

may be obtained by pooling the individual -sample variances. 

The better estimate of the wi thin-sample or error variance 
2 

is given by: 

(5-3) Se^ = ^ 

(n^ + n^ + n^) - 



where n^ denotes the number of wells and n^, n^ .... 
denotes the number of measurements in wells V/1, W2 . .'. . VJk. 
The numerator of equation (5-3) Is called the "within- 

O 

groups Slim of the squares", and Se may be called the mean 
square of individual measurements. 

2 2 

The degrees of freedom associated with Se are: 

^1 

(5—^) f e = 2 (nk)^ - n^ 



v;here n^^ denotes the number of wells and (nk)^^ denotes the 
number of measurements in the 1th well, or fe = the total 
number of measurements in all wells less the number of v;ells. 

Another independent estimate of the population variance 
may be obtained from the sample means. The variance of the 

■5 '"■ 

population of means is:"'^ 



^1 



(5-5) 



m,p 






n^^ - 1 



100 



where: 

(5-6) 



= 



^W2 



X. 






Number of Wells 



However, the variance of the population of means Is not In 
Itself equal to the population variance. If all samples 
v;ere of the same size, the second estimate of the population 
variance would be:^ 



(5-7) 




n 



k 



S 



2 

m,p 



If the samples are of different size, ^2 ^ 

vihlch Is usually the case In core analysis, a pooled result 

3 

Is to be used as an average sample slze,"^ 



(5-8) 



n 



o 




1 




n 






n. o 

2 [(nj^)l] 

2^1 (nj^)i 



3 

Therefore: 

(5-9) = n^ m,p 

when S Is estimated by either equation (5-7) or (5-9) ^ 

2 

n^-1 degrees of freedom are Involved. Is often referred 

to as the mean square of sample means. 

If the random factors which give rise to the v/lthln- 

2 

sample or error variance Se are the only factors causing 

the differences between the sample means, the two Independent 

2 2 

variance estimates Se .and S should, except for sampling 

error, be equal. The probability of particular rations of 
2 2 

S to Se has been computed by Fisher. G?hls distribution 
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is the v;ell known F distribution^ which is a function of the 

P P 

degrees of freedom associated with both S and Se . There- 

P 

fore the ratio, ^ 

(5-10) 



P 

Se^ 



is a measure of v/hether or not random sampling error can 

account for the observed differences between sample means. 

By the use of the P table and the degrees of freedom associated 
2 2 

v;ith Sp and Se , a determination of whether random sampling 
errors could account for the differences between the means 
can be made. 

Conducting the P test on the porosity core analysis 
results for Field 2 gives: 



n. 



n. 



(5-5) 



Se2 = 



2 1(X^-.199)^ + 2 + 



¥2 



(24+28 ) - 14 



(5-5) 



Se^ = .0004559 

_ _ o 

m,p n^^ 1 



(5-6) 



(5-5) „ 

' m, p 



- . 199 + . 189 + .200 + . 212 

Xp = 14 



Xp = = -197 

V . 

(. 199 -. 197 )^ + (.189-. 197)^ (.212-. 187)^ 

_ 1 



S 



2 

m,p 



-5 



= 9.128 X 10 
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(5-9) 




n. 




(5-8) 



n. 



lfcx(5'‘5 - 



2 24^ + 28^ + 26^ 
2 24 +28 +26 



+ 30^ 
+ 50 



= 24.507 

Therefore: 

S ^ = (24. 507) (9. 128 X 10"5) = .002257 

(5-10) p = = ♦ QP ^ ? . ?I 

Se .000456 
P = 4.907 

p 

Since is associated with l4 - 1 = 15 degrees of 

p 

freedom and Se with 545 - 14 = 551 degrees of freedom, the 
P table^ gives P = I.76 at the 5 percent confidence level, 

P = 2.14 at the 1 percent confidence level. 

The P value of 2.14 at the 1 percent level Indicates 
that there is only 1 chance out of 100 that the observed 
differences between means can be explained by random errors 
if the calculated P exceeds 2.14. In this case 4.907 > 2.14, 
therefore it is reasonable to conclude that the populations 
represented by the sample means are actually different and 
the difference cannot be explained by random variation. 

Since there is an actual difference between the l4 
sample means, further testing must now be conducted to 
attempt to determine if there are sub-groupings of these 
means that are statistically homogeneous. Por this field. 



the wells v/ere liomped together by leases and an P test 
conducted to determine if the means of the three leases 
v/ere statistically the same or v/hether they were different. 



Lease 


Number of 
Measurements 


Pield 2 
Mean (X<p) 


Variance (S^^) 


1 


179 


.1948 


. 000383 


2 


71 


.1950 


. 000670 


3 


95 

345 


. 2031 


. 000641 



Por leases: 



Se^ = .000512 
= .002368 




4.62 



Degrees of freedom v/ith Se^ = 3^5 - 3 = 3^2 

p 

Degrees of freedom with S = 3 - 1=2 

Jb' 

The P table gives: 



p =5.03 at the 5 percent confidence level 
p = 4.69 at the 1 percent confidence level 

Since the calculated P, 4.62 < 4 . 69 , "the observed 
difference in the three leases means could reasonably be 
explained on the basis of the scatter of the observed data, 
and the three means are statistically the same. 

It may be concluded from the tvro P tests that the 
individual well means are statistically different, but the 
lease means are statistically the same. The well means each 




i 



i 




I 
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represent individual points in the field and since these 
points are different, heterogeneity exists between the 
various wells. As the wells are lumped together to form a 
lease mean the heterogeneity of the individual points are 
absorbed into larger somev;hat homogeneous units, these 
larger units exhibiting similar statistical characteristics. 

Modified Tukey Test 

The results of the P test on the means of the 14 vjells 
gave convincing evidence of differences among the means, 
but the P test gave no clue as to how many differences there 
v\rere. In a group of a means there are in all a(a-l)/2 
potential differences; l4(l3)/2 = 91 among the wells. Does 
each mean differ from all the rest, or are some of them 
the same? 

One method of investigating the differences is by the 
Tulcey test (modified)^. The test is made by computing a 
difference, D, which is significant at the 5 percent level, 
then comparing it I'llth the a(a-l)/2 sample differences. D 
is the product of and a factor, Q, taken from a Q table^ 
v;hich is itself computed on the basis of the distribution of 
the deviations among compared means. 



(5-11) 




and: 



( 5 - 12 ) 
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To determine Q enter the upper heading of the Q table 
v;ith the number of treatments (l4 wells) and degrees of 
freedom, f, for samples (331) indicated at the left of the 
table. Prom the table with 14 treatments and f = 351# Q 
is 4.7^. 

Por porosity, Pield 2, 

= 1.86 X 10"5 

and, 

= 4.313 X 10"^ 

Therefore: 

D = (4. 74) (4.313 X 10"^) = .02048 

The differences to be compared with D are shown in 
Table IX. 

In Table IX the X^ are arranged from high to low and each 
is subtracted from those above. Of the 91 differences, only 
15, indicated by exceed D = .0205. One inference from 
the table is that the population represented by the mean of 
v;ell 13 is different from that of wells 14, 8, 12, 4 and 11. 
Similar statements may be made about any particular \\rell. Por 
instance it may be inferred that the population represented 
by the mean of well 14 is different from that of wells I3, 7# 
6, 2, 10, and 9 but the differences between well 14 and the 
remaining v/ells are not significant. This above procedure is 
therefore useful for seeking homogeneities and dissimilarities 
among -wells. Such information could be valuable in explaining 
similarities and differences in individual v/ell production 
behavior. 



TABLE IX 

Differences Betvjeen Well Means, Modified Tukey Te 
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Sequential Method of Testing 

A sequential method of testing the differences betv/een 

7 

the means devised by Hartley is a somewhat more pov/erful 
testing procedure than the modified Tukey method. For this 
test, not one Q is taken from the Q table but several, one 
for each range of the treatment means. For the well means, 

adjacent means in the array are tested v;ith Q = 2.77 for a = 

2; for two ranks apart in the array use Q = 3 *32 for a = 3; 
for three ranks apart in the array use Q = 3-63 for a = 4; 

use Q = 4.7^ only for the extreme range where a = l4. The 

corresponding D are: 



a Q D 



2 


2.77 


.012 


3 


3.32 


.014 


4 


3.63 


.016 


5 


3.86 


.017 


6 


4.03 


.0174 


7 


4.17 


.018 


8 


4.29 


.0185 


9 


4.39 


.0189 


10 


4.47 


.0193 


11 


^.55 


.0196 


12 


4.62 


.0201 


13 


4.68 


.0202 


14 


4.74 


.0205 



These D are entered in the northeast-southwest diagonals 
of the table of differences. Table X; with the D's in 
parenthesis. Each difference is now compared with its o\-m D, 
the difference being judged significant if it is larger than 
its D. A useful rule to be observed is that if any difference 
is less than its D then no further testing needs to be done 
to the right of that difference in its row or below it in 
the column. 



TABLE X 

Differences Betv;een Well Means, Sequential Testins Method 
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Prom Table X, it is observed that there are l8 signi- 
ficant differences^, marked by an \>jhich signifies that the 
sequential method detects a greater number of differences 
than the modified Tukey method. 

Prom the table the v;ells similar to any given well may 
be detected. Por example, it may be stated that the mean of 
T\rell 9 is not significantly different from any of the other 
means except well 14. The mean porosity of viell 3 is not 
significantly different from that of any other mean except 
for well 13. 

The sequential testing method then provides a comparison 
mechanism for determining the relationship between any well 
mean desired v;ith the other v;ell means in the group. 

Homogeneity of Variances 

Several tests are available for testing the homogeneity 
of the variances of the several samples. The appropriate 
test depends upon the type of possible variation of the 
variances that is visualized. If the deviation from equality, 
i.e., = <^ 2 ^ ='.... = conceived to be caused by 

a random variation then an appropriate variance homogeneity 

O 

test is Bartlett's test° wherein the statistics B and C are 
computed v/here: 

B = ^(v In - S v^ In 



(5-13) 



^ ■ • . 3 , Q? .S2(v log^Q - z v^ losio ^i^) 





ll 



! 

■ 



f 



M 
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(5-1^) 

where: 



2(i-) - i 



C = 1 + 



3 (K-1) 



2 ^ 

V = Z and S = ^ ^ 



is the degrees of freedom associated with each sample 
2 

variance and K is the number of variances being con- 
sidered. 

The statistic B is known to satisfactorily approximate 

the Chi-square (X ) distribution corresponding to K-1 degrees 

9 /2 

of freedom.-^ A calculated B value greater than the X value 

at the given degrees of freedom and confidence coefficient 

p p 

is evidence for rejecting the hsrpothesis ^2 = 

2 

Oy. . Calculations- for porosities of Field 2 are given below 
to illustrate the method. 



TABLE XI 



Computation of Bartlett's Test of 
Homogeneity of Variance-Porosity Field 2 



Well 

No. 


(n-l) 


p 

Sl" 


log 


v^ log 


1 

^i 


''1 Si® 


1 


23 


1.983 


.29732 


6.8384 


. 044873 


45.609 


2 


27 


2.019 


.30514 


8. 2388 


.037037 


54.513 


2 


25 


1.233 


. 10323 


2.7058 


. 040000 


32.075 


A 


26 


2.551 


.39967 


10.3914 


.03846 


66. 326 


5 


17 


2.633 


.42537 


7. 2313 


.05882 


44.761 


6 


35 


8.903 


. 94954 


33.2339 


.02857 


311.605 


7 


13 


1. 292 


. 11126 


1.4464 


.07692 


16.796 


8 


22 


1.740 


. 24055 


5. 2921 


. 94546 


38. 260 


9 


33 


6.071 


.78326 


25.8476 


.03030 


200.543 


10 


19 


2. 282 


.35332- 


6.8081 


. 05263 


43.353 


11 


30 


4.023 


.60455 


18.1365 


. 03333 


120.690 


12 


18 


8.938 


.95124 


17.1223 


.05555 


160.884 


13 


14 


9.458 


. 97530 


13.6612 


. 07143 


132.412 


14 


29 


3.109 


. 49262 


14.2860 


.03448 


90.161 




331 






171.2396 


. 64788 


1357.813 



^'•Converting porosity data to percent (.199 to 19- 9) changes 
variance from .0001983 to 1.983 percent. 



K = 14 V = 331 
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(5-13) B = (v log - 2 log 



( 5 - 1^0 



2(- ) - - 



C = 1 + 



3 (lC-1) 



1 

V 



331 



= .0030211 



C = 1 -f -5^^788 - .0030211 
3 (1^-1) 

0=1+ .01^33 = 1.01^33 



, a^7,8 . a , 4.1022 



log^Q = .61302 



(5-13) B = (331) (.61302) - 171.2396) 

B = (2. 27006) (31.67) 

B = 71.98 • 

Prom table^*^ X7lth 13 degrees of freedom (l4-l) 

= 22.^ (at 5 percent level) 

= 27.7 (at 1 percent level) 
=29.8 (at 1/2 percent level) 



p 

A X 29.8 at the l/2 percent level indicates that 
there is only 1 chance out of 200 that the differences 

hetifeen the variances could he caused by random variation if 

\ 

the calculated B > 29.8. Since the B = 71.98, it may be 
concluded that the differences betv/een the variances of the 
1^1- v;ells are significant and cannot be accounted for by 
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p 

random variations. Therefore the hypothesis that cr^ = 

2 2 

cr^ = . . . . is rejected. 

Summary 

Various statistical tests that may be used to analyze 
core samples by use of the core means and variances have 
been illustrated. A core sample provides certain information 
about the population from v/hich it is extracted. The res- 
pective populations sampled by core^, core^, .... core^i;^, 
for Field 2 may be thought of as falling into one of four 
categories: 

1. The 14 samples represent identical populations, the 
apparent differences resulting from sampling error. 

2. They represent identical variances but have different 
means . 

5. They exhibit identical means but different variances. 

4. They are totally different. 

The P test indicated that the populations had different 
means. Bartlett's test shov/ed that the populations had 
different variances. Therefore we may conclude that the 
populations represented by the 14 samples are different. 

By use of the Tukey test, and the Sequential test it 
is possible to determine v;hat sub-groupings of the 14 
populations are statistically homogeneous. An indication 
that any particular v/ell is statistically the same as a 
group of other wells should prove to be a valuable tool in 
correlating the expected performance of that well with the 
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performance of the group v;ith v/hlch it may be Identified. 

In this chapter particular attention has been given to 
the distribution of sample means and of sample variances, 
yet in reality there is very little exact knowledge, based 
upon sedimentation theory, as to the appropriate distributions 
to expect. It is to be hoped that the methods of the 
present chapter v/ill be useful for quantitatively character- 
izing these distributions and thereby aid in the formulation 
of more precise theories of sedimentation. 




I' * 
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CHAPTER VI 
SAMPLING 

Introduction 

The prime purpose of core analysis is to gain some 

understanding of the characteristics of the field properties 

under consideration, namely porosity, permeability, and 

fluid saturations. As previously discussed, once sufficient 

information becomes available, it may be used to determine 

descriptive parameters for a field. The parameters may be 

expressed simply in terms of averages and deviations from 

the averages. Or more extensive parameters may be developed 

in terms of theoretical frequency curves and the moment 
2 

parameters and 6. 

Considering the initial period in the life of a field 
before an extensive amount of data becomes available, the 
question arises as to vihat inferences can be derived from 
the data as it becomes available. 

Sampling 

Sampling may be defined as the selection of part of an 
aggregate or population, on the basis of which a Judgement 
or inference about the aggregate or population is made.^ 

This sampling theory is a study of relationships existing 
beti'/een a population and samples dravm from the population. 
Statisticians have long worked within the problem of reconstruc- 
tion of a universe of variables by means of samples that 
comprise a small percentage of the universal or population 
from which the samples v;ere drax-m. Core analysis data appears 
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to comply V7ith the requisite of random sampling as stipulated 
by theoretical statistics. 

Sampling theory is useful in estimation of the unknov/n 
population quantities such as population mean, standard 
deviation, variance, etc., referred to as population para- 
meters, from a knowledge of corresponding sample quantities. 

In general a study of inferences made concerning a 
population by use of samples draxm from it together with 
indications of the accuracy of such inferences using prob- 
ability theory, is called statistical inference. 

Considering a core analysis as a random sample dravrn 
from an infinite population, the sub-surface of the earth, 
what inferences can be made about the population? Prom the 
core analysis there may be obtained a sample mean and a 
sample standard deviation where: 

(6-1) X = P 

(6-2) 

The sample mean, vxhich serves as an estimate of the 
population mean v;ill be of chief interest in considering a 
core sample. An important theorem in statistics- states 
"that for almost all populations the probability distribution 
of the sample mean based upon a simple random sample will be 
an approximately normal one if the sample size is sufficiently 
large. The standard deviation of the probability distribu- 
tion of the sample mean, denoted by is calculated by:^ 



ll6 



(6-3) 






fx 

\/n 



V7here cr^ is the standard deviation of the population being 
sampled, and n is the sample size. 

Therefore if the standard deviation of the population 
is known, may be calculated. Hov/ever, after only one 
core analysis, the standard deviation of the population is 
not ICQOim and an estimate of this standard deviation must be 
made. A point estimate of this population characteristic 
is made from the sample standard deviation, S^, and the 
sample mean is the best point estimate available for the 
population mean. 

To illustrate the application of point estimates, the 
numerical data obtained from an assumed first core analysis 
taken from Field 2, Figure 28, will be used. 

From the first core analysis, well 1, we obtain; 



^X0 " 



X 



so 



Xso 

= 



Xsw 
n = 



18.6^ 
2.98 
it 5. 8^ 

4.56 

45.3^ 

4.14, 

36 



Therefore the best estimate of the various population 
means and standard deviations are; 



S2* 

S5* 

S5» 

S6* 

S7« 



N5« 



• 



N2 • 






Nl* 



1 * 



SI* 



S4* 



SCAIiE 
1" = 400' 



N6* 



FIGURE 28 MP OP I-ffiLLS CORED— FIELD 2 
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^ 0 population 18 . 6 $^ 

X ■ = 45.85^ 

population 

^sw = ^5.3^ 

"population 



and. 



^X 0 " 
^Xso^ 
^Xsw"^ 



2.98 

4.56 

4.14 



And, ass\une that: 






X estimate 
\/n 



. 2-98 



°Xso = -75 
'^sw = -®9 



.48 



For normal probability distributions, the area under 
the normal distribution curve betv/een the mean + 2 standard 

li. 

deviations is about .95 out of a total area of 1 . Therefore 
the following limits may be constructed with a confidence 
of . 95. 

( 6 - 4 ) • X + 2 a X 

X0 + 2 (. 48 ) = 18.6 + .96 
17.64 - 19.56 

Xgo ± 2 (.t6) = 45.8 + 1.52 
44.2 - 47.4 

Xg^.j ± 2 (. 69 ) = 45.5 + 1.58 
45.9 - 46.7 
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We can conclude v;ith a . 95‘ Probability that: 

for Field 2 is somewhere between 17.64 and 19.56^. 

Xso i*or Field 2 is somewhere betiveen 44.2 and 47.4^. 

^sv; Field 2 is somev/here betv/een 45.9 and 46.7/«. 

The above conclusions are based on knov/ledge of the 
true value of V7hich v;as not the case. Statistical 
history has indicated though that if the sample size is 
reasonably large, n > 50 ^ the may be used as an estimate 
of ;^^ithout materially changing the reliability of the 
above conclusions. 

Control Charts 

From the previous section, estimates of the probable 
range of the various property means of Field 2 were deter- 
mined. The ranges ivere based upon one core analysis v^hich 
V 7 as taken from one point in the field. For the population 
which this core sample represents, the sample mean ranges 
determined should prove to be satisfactory range estimates 
of the population mean. As drilling moves away from this 
point, a different areal population may be encountered with 
different characteristics from those determined by the 
initial core sample. -In this case it vjould be expected 
that the nevi population characteristics may deviate from 
those estimated from the initial core sample. , 

In statistical quality control procedures for production 
processes, a control chart is used to indicate V7hen a process 
has changed or "gone out of control" and is no longer 



producing vjithin prescribed specifications. Basically the 
production control procedures consist of computing a process 
mean and standard deviation, constructing a control chart 
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vxithin the limits of Xprocess + 5 talcing periodic 
samples from the production line and computing the mean of 
the samples taken. If the mean falls between the limits 
set on the control chart, it is concluded that the process 
has not changed or "is in control. " If the mean does not 
fall i\rithin the limits, it is concluded that the process has 
changed or "is out of control. 

Core sampling may be considered analogous to production 
line sampling V7ith areal sampling of populations considered 
to be the analog of periodic process sampling. As core 
samples are talcen at varying distances from the initial 
sample, the means may be plotted on a control chart to indicate 
if the population- characteristics are still "under control, ", 
or if not, v;hether a nev; areal population has been encountered. 
For a control chart, +5 normally used. Thus: (from 

previous section) 



^0 ± 5 0^ = 18.6 + 5(.48) 
18.6 + 1. 44 
17.16 - 20.04 



^so ± 5 = ^5.8 + 5(.?6) 

45.8 + 2.28 

43.52 - 48.08 

^sw = ^5.5 ± 3 (.69) 




I 



45.3 ± 2.07 

45.25 - 47.57 
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Using the above, a control chart may be constructed. 
Prom Figure 28 it can be observed that the wells are located 
in a general north-south direction with well 1 located 
approximately at the center of the field. 

For this particular geographical arrangement, two 
control charts to test the hypothesis that the mean of the 
field falls within the probable range are constructed, one 
for moving in a northerly direction, one for the southerly 
direction. 



For moving North: 

% 



2 / 



20 



/Upper Control^ 
'‘Limit 20.04 ^ 



Average I 8.6 



/Lovrer Control 
'‘Limit 17.16 



17- 



) 



/«i 1 — .. . I 1 1 I ,, , I 1 L 

A// Mt a/3 vy A/e: */e 

'' Wells 



FIGURE 29 Control Chart for Average Porosity 
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The core analysis from well N1 has a of 19.2. .This 
is plotted on the control chart, and falls within the limits 
set. It is concluded that the sampling is still under 
control and the population has not changed from that indicated 
by the initial sample. As the core analysis from v;ell N2 
through No are taken, their means are plotted on Figure 29. 

Moving north from initial v;ell: 



V/ell X. 

9 



N1 


19.2 


under control 


N2 


20.5 possibly out of control 


N3 


19.9 


under control 


N4 


18.9 


under control 


N5 


19.9 


under control 


N6 


19.0 


under control 



It appears that the probable range of the field mean 
calculated from the initial core analysis gives a good 
representation of the field mean from the initial point to 
vj'ell N6. 

For moving South: 

% 




Wells 

FIGURE 30 Control Chart for Average Porosity 



Moving south from initial v/ell; 
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VJell 



SI 


18.4 


under conthol 


S2 


19.1 


under control 


S5 


20.8 


possibly out of control 


S4 


20.8 


more indication of 
out of control 


S5 


21.2 


out of control 



For movement in a southerly direction the initial 
conclusion concerning the field mean proved correct from the 
Initial point to v/ell S2. When it becomes apparent that 
the initial hypothesis concerning the mean is not tenable, 
in this case after v;ell liras drilled, a new probable range 
of the areal mean should be calculated from the data obtained 
from vrell S3 and S4. This nevr probable range would then be 
the best estimate of the population mean for the areal 
population from S4 southward. 

The adoption of the above method, vrhlch is based on 
production quality control techniques, to the analysis of 
reservoirs is one possible way in which a quantitative criterion 
could be used to segment a field into smaller homogeneous 
units. Many extensions to this procedure are possible. 

These involve the comparison of the segmentation created by 
use of the different reservoir properties. In this regard 
it is possible that the areal segments would not coincide 
vihen using different properties such as porosity, permeability, 
or fluid saturations. Hov/ever there is also no "a priori" 
reason for their being different. Similarity of areas 
iTOuld probably indicate a fairly high degree of correlation 
betv/een the variables involved. 



I 



■i 

!l 

]{ 

I 

±1 
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Confidence Interval for Population Proportion 

If a random sample size Is large, n > 30 usually being 
considered sufficientljr large, a confidence interval for the 
population proportion can be constructed in a similar way 
as for the population mean, since the probability distribu- 
tion of the sample proportion will be assvuned to be approxi- 
mately normal. 

The standard deviation of the probability distribution 

g 

of the sample proportion can be estimated from: 

(6-5) S- = ^p(l-p) 

vihere p is the sample proportion and denotes the estimate 
of the true standard deviation a—. 

The confidence interval for the population proportion 
is: 

(6-6) p + Z 

where Z is the normal deviate corresponding to the desired 

7 

confidence coefficient. ' 

To illustrate the application of confidence Intervals 
for population proportions to core analysis, suppose that 
it has been decided that an oil saturation of 30 percent or 
less is undesirable in considering the amount of pay zone in 
a field. The initial core analysis taken from Field 2 v;ill 
be used for example calculations. 

There v;ere 36 samples from v/ell 1, 6 of which were ^ 30 



percent. Thus: 
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and, 

^^nsnnun = .066 

P 55 

O 

And for a confidence coefficient of . 95 # Z is I.96, 

p ± Z 

.167 + 1.96 (.066) 

.167 + .129 

Thus it may be concluded VJith a confidence coefficient 
of 95 percent that the percentage of the field pay zone that 
is undesirable according to the predetermined criteria is 
somevThere betv/een 3*8 percent and 29.6 percent. 

Another way to interpret the use of the population 
proportion and its relation to the confidence coefficient is 
illustrated by the following modification of the above 
problem. Instead of establishing a confidence coefficient 
of 0.95 and determining the range in the percent of formation 
which vjould be unproductive, suppose that one seeks to know 
the confidence coefficient associated \ 7 ith a particular 
percentage range such as I/6 + 3 percent, or . I67 + .03. 

One then determines the value of Z corresponding to the + 

3 percent range. In this case: 

X _ 

interval in population proportion = P + Z 

A value of Z is sought such that Z(S^) = .03 or Z.= 

.03/. 066 = .455. This value of Z corresponds to a confidence 
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coefficient of .35* The interpretation is that the true 
unproductive fraction of the formation has 2 chances out of 
5 to be outside of the proportion .167 + .03 or a 1 chance 
in three (approximately 35 percent) of lying within the 
proportion range .167 + .03. 

Surnmar:/ 

T\’7o rather elementary techniques of estimating informa- 
tion about a field v;hen the amount of core analysis data is 
limited, in this case one well, have been presented. For 
Field 2, the sample mean of v/ell 1 gave good results as an 
estimation of the population mean for the areal population 
from well S2 to N6. 

The utility of these sampling techniques can only be 
confimed after being tried and tested with a large number 
of fields vxhere the actual results could be compared v;ith 
the estimated predictions. It would be conceivable to assume 
that over the long run, sampling techniques vxould provide a 
better estimate of what is to be expected than those estimates 
determined intuitively. Granted, in some cases the reservoir 
predictions by sampling techniques would prove to be in error, 
and yet the estimates could shov/ less error than the error 
or difference found by intuitive predictions. 

Sampling results should be recognized as only estimates 
and not as inf alliable "predictions. Yet the sampling 
estimates should prove better than no estimates at all. 

For any one reservoir the estimates are either right or 
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VTTong. The .95 confidence coefficient means that if the 
procedure is follovred for a long enough time over a sufficiently 
large number of reservoirs, the results should be correct 
95 percent of the time. 

It v;as the purpose of this discussion on sampling to 
indicate how certain techniques could be applied and possibly 
to stimulate the study of the applicability of other more 
advanced or specific sampling techniques. It is felt that 
the vjorlc and investigation in this area has just begun v;ith 
unlimited possibilities yet to be explored. 



CHAPTER VII 

SUI®I''JIY AI^JD CONCLUSIONS 

Introduction 

At the outset of this study tvro alternate courses of 
investigation v;ere considered for the statistical analysis 
of the field data availahle; the first, to concentrate on 
one statistical technlq^ie and analyze it in detail, and 
second, to explore various techniques vfith less detailed 
analysis of any one point. The latter course v;as chosen 
v;ith three very hroad areas included in the study, i.e., 
generalized or slcev;ed distribution curves Including trans- 
formation of data to logrithms for analysis, analysis of 
variances, and sampling procedures. 

The choice of this latter course vms prompted by the 
realization that relatively little use of theoretical 
statistics has been made in core analysis as Judged by 
available discussion in the petroleum literature. An 
attempt v;as made to present in one volimne the necessary 
calculation procedures and interpretations to serve as a 
broad guide for a statistical analysis of field data. 
Additionally, it is hoped that the study \vill serve as a 
basis for further investigation into the applicability of 
the science of statistics to reservoir problems. 

Summary 

This thesis has attenpted to indicate the applicability 
of certain statistical procedures to the analysis of core 
samples. Statistical techniques ^^^ere employed to segregate 
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relevant and meaningful information from a mass of rav; data. 

A method of detemining e.nd describing frequency distri- 
butions of the physical properties of a reservoir is presented 
in Chapter III. The Pearson family of generalized frequency 
curves provides an effective means of classifying observed 
frequency distributions v/ithout being restricted to a fev; 
classical distributions such as the normal curve, the Poisson 
curve or the binomial distribution. 

Chapter TV is devoted to the study of the effects of 
the transformation of rav; data to logrithms for analysis. 

The results obtained V7ere compared with the results of Jan 
Lav;'s^ v/orlc. 

Analysis of variances is studied in Chapter V i-rith 
various testing procedures shov/n. The P test and Bartlett's 
test are employed to test for uniformity of the population 
represented by the various v/ell samples. Tlie modified 
Tulcey and Sequential tests are used to determine v;hat sub- 
grouping of the wells are statistically homogeneous. ' 

Sampling procedures are presented in Chapter VT. 

Sample means are analyzed and presented on control charts 
by techniques analogous to production line sampling techni- 
ques. Population proportions are presented with confidence 
intervals. 

X . 

The follox-/ing listing presents a summary of the main 
equations associated with the above statistical procedures. 
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For Frequency Distributions 
Pearson Type I Curve 



( 2 - 2 ^^) 



in, in, 

Y = C(t-r^) (r^-t), ^ 



Type IV Curve 



TT 



(2-31) Y = C[(t+r)2 + S^)-™ e-'^ e''® 



Type VI Curve 
(2-5^) 



m, 



m., 



Y = C(Z (Z-a) ^ 



Normal Curve 
( 2 -^ 0 ) 



Y = C e 



2 



Type III Curve 
(2-45) 



Y = C^(A+t) 



A^-1 ^-At 
e 



The F Test 
(5-10) 

vihere: 

(5-9) 

and, 

(5-3) 



For Analysis of Variance 



F = 



Se^ 



S ^ = n 
p o m,p 



n-. 



Se^ = 



s‘^(X^-X3_)2 4- .... 

(n^ + . • • . ** 



(5-11) 



Modified Tukey and Sequential Testing 



= 



3 



(5-12) 



D = Q 
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(5-lp) 


Bartlett ' s Test 
B = i(v In - 2 In Sj_^) 


v/here: 


^11 
^ 7r “ Tr 


(5-1^) 


C = 1 + ^ 

5 (K-1) 


(6-2) 


For Sampling 

o ^z(x-x)^ 

^X ~ n 


(6-5) 


Q 

XI 

II 


1 

VJI 


s- 

^ n-1 


(6-6) 


p = + Z 



f 




I 



I 
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Conclusions 

In conj'unction v;ith the comments given in the summa]?y 

sections of the specific chapters, the major conclusions to 

be derived from the data studied are believed to be: 

1. Generalized frequency curves are useful in reflecting the 
variation of individual properties throughout a single 
depositional unit, and in characterizing and distinguish- ’ 
ing the unit from other units in the same geological 
basin. The terms and 6 should provide additional 
relevant numerical parameters for mathematical model 
analysis of a reservoir. 

2. Logrithmic transformation is occasionally useful to 
remove some skevjness from permeability data, but does 
not necessarily convert the data to the normal distribu- 
tion under all circumstances. 

3. Analysis of variances tests provide a useful tool to 
investigate homogeneity between well properties vrithin a 
reservoir or within sub areas of a reservoir. 

4. Sampling tecliniques may be utilized to estimate unlcnovrn 
population quantities from a knowledge of relatively 
small sample quantities. 

5. The science of theoretical statistics applied to reservoir 
analysis is in its infancy and appears to offer many 
opportunities for the solution of reservoir problems. 



f 
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Recommendations for Ii\?.rtlier Investigation 

T'Jhile v/orking v;ith the Pearson frequency curves, certain 
areas of further investigation appeared to offer promise: 

1. An investigation into the relationship between reservoir 
performance and the type of frequency distribution 
follov/ed by its physical properties. For example, does 
a field v:ith a Pearson Type III distribution perform 
differently than a field v/ith a Type I, other factors 
being the same? IJhat is the correlation, if any, 
beti\reen the parameters (cx^, 6) and field performance, 

the parameters o_ and 6 being considered for any physical 
property or group of properties? 

2. Sampling techniques are available for populations that 
approximate the normal frequency distribution. Could 
reliable techniques be developed for populations that 
satisfy certain types of the Pearson family of frequency 
distributions? A table of Areas Under the T 3 ^e Illg 

p 

Curve has been developed and could possibly serve as a 
point of departure for investigating sample techniques 
for core samples. 

Numerous tests have been devised for the analysis of 
variances. Certain of these tests v/ere presented in this 
thesis. Other tests may prove more useful than the ones 
presented. Further investigation into this area of statisti- 
cal analysis is highly recommended. 
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tor. In advanced mathematics it is sometimes 
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fraction. 
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VJhittaker, E. and Watson, G. , Modem Analysis , 
New York: 19^1-6, p. 253^ para. 12.4. 
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the First Kind in terms of the Gamma Function, 
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P (m, n) 



Fm Fn 
rTmTnJ 



Tlie total probability v;as set equal to unity to 
determine C. For. calculational purposes, the value 
N is placed in the numerator of the equation in 
order to account for all frequencies in a set of 
data. N = total nixmber of measurements in a set 
of data. 
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Ibid, p. 21. 
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2 
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Scarborough shov;s the method of numerical, integra- 
tion involving Gaussian coefficients v;hich is 
considered to be the best general purpose quadrature 
procedure. The method for determining I v/here: 

b 

I . / f(x) 

a 

by this technique is: 

i=n 

I = (b-a) Z R. f(Z. ) 

i=l ^ ^ 

v;here are the selected Gaussian coefficients 

and the are selected, knoT/m values of the 

independent variable x transformed to a scale 
wherein the raiige a to b becomes the range from 
0 to 1. For the present integration it \ias found 
necessary to use n = l6 for appropriate accuracy 
of the function. The "Weight Coefficients" 
for the Gauss Quadrature Formula for order l6 
were taken from Lov/an, pp. 739-7^3 • 

20. Smith, J. G. , Elementary Statistics, New York, 

193^, p. 287 . 

21. Carver, op. cit ., p. IO 5 . 

22. Craig, op. cit. , p. 27. 
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these grouping errors, knovm as Sheppard's cor- 
rections, is presented in Kendall, M. G^ The 
Advanced Theory of Statistics , London, Vol. I, 
l'9^5, P^^ 7^- Sheppard's corrections are: 

Xg = P-2 - 1/12, = li^, - 1/2 H2 + 7/2^0 

2. Kendall, M. G.,^The Advanced Theory of Statistics, 
London, 19^5, Vol. 1, p. 49. 
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4. Davis, H. T. , Tables of Higher Mathematical 
Functions, Vol. I, Bloomington, 1933# p. 179. 
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. MATHEMTICAL, ' FORTRAN, AND TECHNICAL NOTATIONS AND TERMS 



ithematlcal 


Fortran 




Notation 


Notation 


Description 


A 


A 


A parameter of the Pearson Type III 
Curve, equal to 2/ALP5 




ASQ 


Square of A 


a 




Parameter of the differential equa- 
tion (2-I); also slcevjriess of the 
distribution of a set of data defined 
by equation (2-15 )i also, a term used 
In the Sequential method of testing 
to describe the range of means In an 
array 




AJ 


Number of measurements In a particular 
class Interval 




AK 


Total number of measurements 


a 




The n^^ a term where a = 


n 




th ^ 

n moment above mean ^ cnuation 






0 

( 2 - 7 ) 


a 


ALPHA 


A parameter of the Type VI Pearson 
Curve, see equation (2-37) 




ALP5 


A basic parameter of the Pearson 
System of Frequency Curves, see 
equation (2-7A) 




■ALP3S 


Square of ALP3 


% 


ALP4 


A basic parameter of the Pearson 
System of Frequency Curves, see 
equation (2-7A) 




AN 


Number of class Intervals 




ARG 


Intermediate variable for computing 
YEAR of the type curve 




ARGl 


Intermediate variable for computing 
YEAR of the type curve 



1 



]l 



I 



p 



i 

I 



I 



1^2 



Mathematical Fortran 

Notation Notation Description 



B 



b 



C 




D 



A statistic calculated in Bartlett's 
test for variance homogeneity, see 
equation (5-15) 



An intermediate value for derivation 
of the constant for Type I curve, 
see equation (2-28) 

bo> b^, bg are terms of equation (2-2), 

vjhere f(t) is expanded in a converg- 
ing power series, values of terms are 
defined in equation (2-ll) 



BELSQ Summed BSQ between observed data and 
Normal Curve 

BSQ Equivalent to an individual 
on page ^5 for Normal Curve 



CHI 



A constant in the frequency curve for 
a particular set of data, also a 
statistic calculated in Bartlett's 
test, see equation (5-1^) 



Equivalent to an individual 



(f-F)^ 

P 



on page for Pearson Curve 



CHISQ Summed CHI betv/een observed data and 
Pearson Curve — a statistical distri- 
bution used for certain statistical 
testing 



CON 



CST 



Storage locations for all parameters 
and the constant characteristic of a 
given Pearson Type Curve 



Pearson Type 

CON(l) 

(2) 

P 

( 4 ) 

( 5 ) 





III 


IV 


EMI 


A 


R 


EM2 


C 


EM 


R 1 




S 


R2 




V 


C 




C 



The Gaussian coefficients (I6) for 
integration in determining P(R,V), 
see Ref. Chapter 5 Tor source of 

coefficients 



D An intermediate parameter for all main 

Pearson Type Curves, see equation (2-I5) 



1^3 



Mathematical Fortran 

Notation Notation Description 



D 



X 



6 






m 



m 



1 



m 



2 



For analysis of variance, a computed 
difference betv/een means that is 
significant at a chosen level, see 
equation (5-12) 

DAVG Arithmetic mean or average of data, 
see equation (6-11) 

DD(I) '"Midpoint value for class interval I 

DECM Difference between the highest value 
permissable in a given interval and 
the lowest permissable value in the 
next higher interval 

DELD Difference betv/een maximum and 

minimum value of a class interval 

DELTA A basic parameter of the Pearson 
System of Frequency Curves, see 
equation (2-12) 

DMAX Ifeximum value for data in highest 
class interval 

DMIN Minimum value for data in lov/est 
class interval 

DMP Term used to locate interval v/hich 
contains the mean 

DSQET The square root of D, see equation (2-15) 

EM A parameter in the Type IV Curve, see 
equation ( 2 - 30 ) 

EMI A parameter in Type I and VT Curves, 
see equation (2-23) 

EM2 A parameter in Type I and VI Curves, 
see equation (2-23) 

EU The independent normalized variable 

for the interval - l/2 to + 1/2 based 
upon l6 subdivisions for Gaussian 
integration. The l6 values of EU are 
taken from Ref. H Chapter 2 
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Notation Notation 
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Description 


EZ 


The term (Sin 9)^ e^® used to determine 
F(R,V), see equation (2-55) 


F 


A ratio devised by Fisher to test for 
homogeneity of means, see equation (5-10) 


fe 


2 

Degrees of freedom associated with Se , 
see equation (5-4) 


FRV 


F(R, V) function, used for constant 
of Type IV curve, see Ref. , Chapter 2 


G 


A function used in the derivation of 
the constant from Type IV curve, see 
equation (2-55) 


f G(J) 


Frequency of values in an interval 


i I 


Index, usually denoting interval number 


i 


Term to indicate a complex root in 
derivation of Type IV curve, see 
equation (2-50) 


ID 


Identification number for the field, 
well, and lease 


IDEPT 


Depth of sample, feet 


INDEX 


An index to distinguish permeability 
data cards from other data cards, 
INDEX = 1 for permeability 
INDEX = 2 for other data 


II 


An index to distinguish the first 
cycle of a loop from all subsequent 
cycles 


I PERM 


Permeability, millidarcys 


0 I PHI 


Porosity, fractional 


ISAMP 


Sample number 


ISO 

0 


Oil saturation, fractional 


00 


Infinity 


J 


Index denoting number of different 
Pearson Type Curve fits desired 
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Description 


k 


Number of variances considered in 
Bartlett's test 


L 


Identification Program — numerical 
equivalent of AN in fixed point notation 


L 


Fitting Program — index used in inte- 
^gration for YAVG 


MP 


Number of the interval counting from 
the lowest which contains the mean 


NA 


Number of intervals in fixed point 
notation 


NDEX 


An index to distinguish among porosity, 
oil saturation and water saturation 
data cards: 

NDEX = 1 for porosity 

= 2 for oil saturation 
= 5 for v;ater saturation 


^0 


An estimate of an average sample size, 
see equation (5-8) 


NTYPE 


Type of Pearson Curve to be calculated: 
NTYPE Pearson Type 

1 I 

2 III 

5 IV 

4 VI 


NUMBR 


Number of different type of curves to 
be calculated in a computer run 


PI 


An intermediate variable for the 
Type IV curve 


P 


A sample proportion, see equations 
(6-5) and (6-6) 


q PROP 


Variable for an individual physical 
property such as porosity, permeability, 
etc. 


IT 


An interval, scale for the logrithms 
of permeability data 


Q 


A factor obtained from a table to 
compute D, see equation (5-12) 
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Notation 

q 

Aq 



Fortran 

Notation 



•2 

S 



R 



R1 



R2 



Description 

Mean value of q, see equation (5-5) 

The interval of q used to construct 
the histogram on the q scale 

A parameter of the Pearson Type IV 
Curve, see equation (2-30) 

A parameter of the Pearson Type I and 
VI Curves, see equation (2-15) 

A parameter of the Pearson Type I and 
VI Curves, see equation (2-15) 

' A parameter of the Pearson Type IV 
Curve, see equation (2-30) 



a 



S 



2 



m, p 




SIGMA The standard deviation of the set of 
data, see equation (3-9) 

A term in Bartlett's test equal to 



V 

Within sample or error variance, 
see equation (5-3) 

Sample variance in Bartlett's test, 
see equations (5-13) and (5-14) 

Variance of the population of means, 
see equation (5-5) 

An estimate of the population variance, 
see equation (5-7) and (5-9) 





Standard deviation of the probability 
distribution of the sample proportion, 
see equation (6-5) 

The sample standard deviation, see 
equation (6-2) 

A sample variance, see equation (5-2) 

The standard deviation of the probability 
distribution of the sample means, see 
equation (5-II) 



1^7 



Mathematical 

Notation 


Fortran 

Notation 


1^7 

Description 


O’— 

X 




The standard deviation of the prob- 
ability distribution of the sample 
mean, see equation (6-3) 




SUM 


An intermediate storage cell used for 
the Gaussian integration in determin- 
ing p(r, v) 




SUMD 


Sum of data values in an Interval 




SUMXG(I) 


Sum for the I ' th moment about the 
midpoint of the interval containing 
the mean, see Table IV 


t 


T 


Independent variable for frequency 
distributions, in standardized nota- 
tion, see equation (3-^) 




TANT/Il 


Intermediate variable for computing 
YBAR of the Type Curve 




TEMPI 


A temporary storage cell for an 
intermediate value in the calculation 
of the Type IV curve 




TEST 


Term for determining I'^hether a data 
value is within overall interval limits 


9 


THETA 


The angle equal to tt(EU) + tt /2 for 
use in the P(RV) calculation 




TOTAL 


Total number of property samples for 
a field 




TE 


Intermediate variable for computing 
YBAR of the Type Curve 


^2 


U2 


2^ moment about the mean, see 
equation (3-3) 




U5 


pd. 

3 moment about the mean, see 

equation (3-3) 


M-4 


U4 


t h. 

4 moment about the mean, see 
equation (3-3) 


M-r 




The r moment about the mean, see 

equation (3-2) 


Hl‘ ^ 


UPl 


First moment of the histogram about 
the arbitrary chosen midpoint, see 
equation (3-6) 






I 



I 
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2 



Second moment of the histogram about 
the arbitrary chosen midpoint 









Third moment of the histogram about 
the arbitrary chosen midpoint 

UP(l) I'th moment of the data about the 

midpoint of the interval which con- 
tains the mean, see equation (5~l) 

The mode of a given frequency 
distribution 






V 



V 



V 



i 



w 



X 



i 




UM0D2 2'nd moment about the mean incorporat- 
ing Sheppard's corrections, see 
equation (5 -10) 



UM0D3 moment about the mean, incorporat- 

ing Sheppard's corrections, see 
equation (5-11 ) 



UM0D4 4'th moment about the mean, incorporat- 
ing Sheppard's corrections, see 
equation (5-12) 



V A parameter of the Pearson Type IV 
Curve, see equation (2-50) 



A term in Bartlett ' s test equal to 



Degrees of freedom associated v;ith each 
sample variance in Bartlett's test, 
see equation (5-13) 



A parameter in the derivation of the 
constant for Type I Curve, see 
equation (2-26) 



XBAR The independent variable for the 

frequency distribution wherein each 
interval has unit width 



XMP Equivalent of MP but in floating 
point notation 

2 

A term used to compute S , see 
equation (5-6) ^ 

The mean of a set of porosity data 





i 



I 




!i 



I 
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^so 


The mean of a set of oil saturation 
data 




The mean of a set of water saturation 
data 


XX 


The values of the independent variable 
for a frequency distribution correspond 
ing to the left, middle, and right 
ends of an Interval. These intervals 
are of unit width 


XZERO 


The value of XX at the left extremity 
of a class interval 


Y Y 


The frequency for a Pearson type 
curve at any value of the independent 
variable, t 


YAVG 


Integrated average value of frequency 
over a specific interval, see 
equation (3-15) 


YNORM 


A point value of frequency on the 
normal curve 


z z 


A parameter for the Pearson Type VI 
curve, see equation (2-^6) 


z 


The normal deviate corresponding to 
a desired confidence coefficient, 
see equation (6-6) 


ZERO 


A numerical constant, equal to zero 



TECHNICAL AND STATISTICAL TERMS 



150 



Term 

Array 

Chi-Square 

Class 

Frequency 

Degrees of 
Freedom 

Gamma 

Function 

Frequency 

Distribution 

Histogram 

Mean 

Median 

Mode 

Moments 

Normal 

Distribution 



Notation 



Definition 



An arrangement of raw numerical data in 
ascending or descending order of magnitude 

2 

Y A measure of the discrepancy existing 

between observed and expected frequencies 

Number of individuals belonging to each 
class in summarizing large masses of 
raw data 

d.f. The number N of independent observations 
fe in the sample (i.e‘. , the sample size) 

V minus the number K of population para- 
meters V7hich must be estimated from 
sample observations 

r(X) A transcendental function used frequently 

in statistics and defined by equation (3-12) 

A tabular arrangement of data by classes 
together V7ith the corresponding class 
frequencies 

A set of rectangles having: (a) bases 

on a horizontal axis (X axis) with 
centers at the class marks and lengths 
equal to the class interval sizes, (b) 
areas proportional to class frequencies 

X Arithmetic average of a set of data 

The middle value of a set of numbers 
arranged in order of magnitude 

[I That value of a set of numbers which 
occurs with the greatest frequency 

[X A term used in the measurement of 

dispersion of a distribution and defined 
by equation (3-2) 

Also referred to as Gaussian curve, 
normal curve of error, or normal 
probability distribution. A bell- 
shaped distribution defined by equa- 
tion (2-40) 



Term 



Notation 



Definition 
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Oil 

Saturation 


So 


A measure of the oil present v/ithin a 
rock expressed as a percentage of the 
total fluid saturation within a rock 


Permeability 


K 


A property of a porous medium and a 
measure of the capacity of the medium 
to transmit fluids. Usually measured 
in darcies or millidarcies 


Population 




Term used in statistics to refer to the 
hypothetical complete enumeration of 
facts in* a particular field of study 


Porosity 


0 


The ratio of the void space in a rock 
to the hulk volume of that rock 
multiplied hy 100 to express it in 
percent 


Range 




The difference between the largest and 
smallest numbers in a set of numbers 


Skewness 




The degree of asymmetry, or departure 
from ssmimetry, of 'a distribution, see 
equation ( 2 - 13 ) 


Standard 

Deviation 


a 


The root mean square of the deviations 
from the mean 


Standardized 

Unit 


t 


A term to express distance from mean to 
midpoint of histogram intervals in units 
of standard deviations, see equation ( 3 - 


Standardized 

Unit 


z 


A term to express the distance between 
the mean and another specified value of 
a frequency distribution as units of 
standard deviations 


Unimodial 




For frequency curves, those curves 
that have only one mode 


Variance 


a® 


The square of the standard deviation 


Water 

Saturation 


Sw 


A measure of the water present within 
a rock expressed as a percentage of the 
total fluid saturation within a rock 
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FORTRAN PROGRAMS AND PLOW CHARTS 



c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

oooo 0 
c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 

c oooo 0 



650 FORTRAN PROGRAM TO 
CALCULATE A GOOD THEORETICAL 
FIT OF A PEARSON TYPE 
FREQUENCY CURVE FOR A GIVEN 
OBSERVED DISTRIBUTION. 

THE PROCEDURE TO BE FOLLOWED 

1 TABULATE THE DATA USING 
CONVENIENT CLASS INTERVALS. 

2 CALCULATE THE MOMENTS ABOUT 
A SELECTED VERTICAL. 

3 TRANSFER THE MOMENTS TO 

THE MEAN. 

A APPLY SHEPPARDS CORRECTIONS 
TO THE MOMENTS. 

5 CALCULATE ALP3S ALPA DELTA 

6 LOCATE THE MEAN (DAVG) 

7 DETERMINE FROM THE CHART 

WHAT TYPE OF CURVE TO USE 
PROGRAM BY J S VAN SCOYOC 
UNDER THE DIRECTION OF DR. 
FLOYD PRESTON 



+ 



+ 



+ 

+ 



OOOO 0 DIMENSION DD( 100 I ,G( 100 ) ♦UPC 8 ) 

0000 1 »SUMXG( 8 ) »X ( 100) »XG( 8 ) 

0001 0 READl. INDEX. NDEX 

0030 0 READl. DMIN. DMAX. DELD.DECM + 

OOOO 0 DETERMINE NUMBER OF INTERVALS + 

OOaO 0 AN =(DMAX-DMIN) /(DELD+DECM) 

OOAl 0 IF(XCONF(D) A2.50.50 

OOA? 0 PUNCHl. AN 

0050 0 L = XFIXF (AN) 



OOOO 0 DETERMINE CENTER OF INTERVALS + 



0060 0 DD(1) = DMIN +DELD/2.0 

0061 0 IF(XC0NF(1) )62.70.70 

0062 0 PUNCHl. DD(1) 

0070 0 DO 80 I = 1. L 

0080 0 DD( 1+ 1) =DD(I) + DELD+1. 

0081 0 IF(XCONF(D) 82.90.90 

0082 0 PUNCHl. DD 

OOOO 0 COUNT D.S IN EACH INTERVAL 

OOOO 0 AND COUNT TOTAL D.S AND SUM D. 



0 0090 

0 0100 

0 0121 

0 0122 

0 0130 

0 OlAO 



0 AK=0, 

0 SUMD=0, 

0 11 = 1 

0 J = 1 

0 AJ=0, 

0 GO TO (150.180 ) . INDEX 



C OOOO 0 



READ IN DATA 
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0150 

0150 


0 

1 


READ !♦ ID» ISAMPt IDEPT» 
I PERM 


0000 


0 


CONVERT TO FLOATING POINT 


0160 


0 


DATA = FLOTF { IPERM) 


0170 


0 


GO TO 270 


0000 


0 


READ IN DATA 


0180 


0 


READ1» ID» ISAMP» IDEPT» IPHI. 


0100 


1 


IS0» ISW 


0190 


0 


GO TO( 200»220»240 ) »NDEX 


0000 


0 


CONVERT TO FLOATING POINT 


0200 


0 


DATA = FLOTF ( IPHI ) 


0210 


0 


GO TO 270 


0220 


0 


DATA = FLOTF ( ISO) 


0230 


0 


GO TO 270 


0240 


0 


DATA = FLOTF ( ISW) 


0270 


0 


IF ( DAT A-DM IN) 140 *280,280 


0280 


0 


GO T0(290,330) ,I I 


0290 


0 


II = 2 


0300 


0 


TEST = DMIN + DELD+DECM 


0330 


0 


IF (DATA-TEST) 340,380,380 


0340 


0 


SUMD = SUMD + DATA 


0360 


0 


AJ = AJ + 1 • 


0361 


0 


IFIXCONFIl ) )362,370,370 


0362 


0 


PUNCH1,SUMD,DATA,TEST,AJ, AK,J 


0362 


1 


I 


0370 


0 


GO TO 140 


0380 


0 


AK = AK + AJ 


0390 


0 


G( J) = AJ 


0395 


0 


AJ = 0, 


0400 


0 


J = J + 1 


0410 


0 


TEST = TEST + DELD+1, 


0420 


0 


IF (TEST -(DMAX+DECM ) )330,330 


0420 


1 


430 


0000 


0 


CALCULATE MEAN 


0430 


0 


DAVG = SUMD/ AK 


0431 


0 


IF(XCONF(l ) ) 43 2, 440, 440 


043 2 


0 


PUNCHl, SUMD, AK, DAVG, G 


0000 


0 


DETERMINE INTERVAL WHICH 


0000 


0 


CONTAINS MEAN 


0440 


0 


DMP = (DAVG-DMIN) /(DELD+DECM 


0450 


0 


MP = XFIXF (DMP) + 1 


0460 


0 


XMP = FLOTF (MP) 


0461 


0 


IF ( XCONF ( 1 ) )46 2,48 0,480 


0462 


0 


PUNCHl, DMP, MP, XMP 


0000 


0 


PLACE INTERVALS ON UNIT BASIS 


0480 


0 


DO 490 1=1,8 


0490 


0 


SUMXG(I) = 0. 
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0495 

0500 


0 

0 


NA=L+1 

DO 560 I = 1,NA 


0510 


0 


X( I ) = FLOTF ( I )-XMP 


0511 


0 


IF(XCONFd) )512»520»520 


0512 


0 


PUNCH1»X( I ) 


0000 


0 


CALCULATF XG TO X8G 


0520 


0 


XG( 1 ) = X( I ) * G( I ) 


0530 


0 


DO 540 J = 1 »7 


0540 


0 


XG( J+1 ) = XG( J) * X{ I ) 


0541 


0 


IF(XCONFd) )542»550»550 


0542 


0 


PUNCHlt XG 


0550 


0 


DO 560 J = !♦ 8 


0000 


0 


SUM XG TO X8G 


0560 


0 


SUMXG(J) = SUMXG(J) + XG(J) 


0561 


0 


IF(XCONFd) ) 562»570»570 


0562 


0 


PUNCHlt SUMXG 


0000 


0 


CALCULATF UPRIMES 


0570 


0 


DO 580 I = It 8 


0580 


0 


UP( I ) = SUMXG (I ) / AK 


0581 


0 


IF(XCONFd)) 582t590t590 


058? 


0 


PUNCHlt UP 


0000 


0 


CALCULATE U2 TO U8 


0590 


0 


U2 = UP( 2 )-UPd )*UPd ) 


0600 


0 


U3=UP(3 )-3.*UP(l )*UP(2)+2t* 


0600 


1 


UP( 1 )»UP(1 )*UPd ) 


0610 


0 


U4 = UP(4 )-4.*UPd )*UP(3)+6t* 


0610 


1 


UP( 1 )*UPd )*UP (2 )-3.*UP (1 )* 


0610 


2 


UP( 1 )*UPd )*UP( 1 ) 


0620 


0 


SIGMA=U2-»*.5 


0621 


0 


lE(XCONFd)) 622t630t630 


062? 


0 


■pUNCHltUltU2tU3tU4 


0000 


0 


APPLY SHEPPARDS CORRECTIONS 


0630 


0 


UM0D2=U2-1 t/12. 


0640 


0 


UMOD3=U3 


0650 


0 


UM0D4=U4-0,5*U2+7. /240. 


0651 


0 


IF(XCONEd)) 652»660t660 


065? 


0 


PUNCHl tUMOD2tUMOD3 tUM0D4 


0000 


0 


CALCULATE VALUES NEEDED 


0660 


0 


ALP3S=UMOD3*UMOD3/ (UM0D2*UM0D2 


0660 


1 


*UM0D2 ) 


0670 


0 


ALP3=ALP3S**.5 


0680 


0 


ALP4=UM0D4/ (UMOD2*UMOD2 ) 


0690 


0 


DELTA=( 2.*ALP4-3t*ALP3S-6. )/ 


0690 


1 


( ALP4+3. ) 


0760 


0 ■ 


■ dfld=deld+decm 


0770 


0 


PUNCHl tNAtN tDM IN tDMAXtDELDt 


0770 


1 


DAVG 
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0780 0 

0785 0 

0790 0 

0795 0 

0795 1 

0810 0 
0820 0 



D0790 I=1»NA 

PUNCHl* I ♦DDC I ) »X( I ) ,G( I ) 

CONTINUE 

PUNCH1,ALP3 »ALP3S»DELTA»SIGMA» + 

UP( 1 ) 

GO TO 30 
END 
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SUI€) = SUMD’ + DATA^-^ — DATA: TEST 












AK = AK + J 






0 H AJ = AJ + 1 ) < 






TEST: DMAX + DECM 



\ 1 / 
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DAVG, MP, XMP, DMP 
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G(J) = AJ 



JkL 



AJ = 0 






J = J + 1 



1=1 



TEST = TEST + DELD + 1 



\/ \/ 



SUMXG (I) = 0 



I: SUI 






1 = 1+1 



NA=I+1 



+ 1=1 



X(I)=I-XMP 



1 = 1+1 






J 






XG(1)=X(I)*G(I) 
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XG(J+1)=0CG(J)*X(I) 



1:8 f UP(I)=SUMXG(I)/AK 



1=1 



X. 



J=J+1 



SIGMA, U2,U3,U4 



^ UM0D2,UM0D3,UM0D4 



ALP5,ALP4, DELTA 



PUNCH 

NA. N. DMIN. DMAX. DELD. DAVG 



DELD = DELD + DECP 



1=1 






PUNCH I, DD(I), X(I), G(I) ^ 



1 = 1+1 



< 



PUNCH 

ALP3, ALP35, DELTA, SIGMA, UP(l) 
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PEARSON FIT FOR TYPES 1»3»4,6 
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DATA FOR READ2»READ3» AND 
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^0000 
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READ4, ARE SUPPLIED BY THE 
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0000 
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IDENT IF ICATION 
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0000 
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NUMBER (FOR READ5 » ) =NUMBER 
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0000 
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OF DIFFERENT TYPES OF CURVES 




0000 
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WHICH ARE TO BE FITTED TO THE 
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0000 
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SAME DATA 
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0000 
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NTYPE=CURVE type NUMBER 
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0000 
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NTYPE=1 FOR PEARSON TYPE 1 
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0000 
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NTYPE=2 FOR PEARSON TYPE 3 
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0000 
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NTYPE=3 FOR PEARSON TYPE 4 
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0000 
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NTYPE=4 for PEARSON TYPE 6 
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THIS PROGRAM NEEDS SINF AND 
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XCONF SUBROUTINES AND SPECIAL 
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GAMMA FUNCTION ROUTINE 
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DIMENSION DD(50) ,X(50) ,G(50) » 
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0 
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CST(16) ♦EUdS) tEZdS) ♦ 
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THETA(16 ) 


0 
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2ERO=0, 
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THETA ( I )=3.1415927#EU( I ) + 
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1.5707963 


0 


0501 
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EZ( I )=SINF(THETA( I ) ) 
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lOl 1 
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DAVG 
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AN=FLOTF(NA) 
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0 
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IF(D)8.9.9 
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0008 
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0000 0 NPCH=417 

0000 0 IF(XCONFd) )417,17»17 

0417 0 PUNCH1»NPCH,NTYPE»NUMBR»NA»D» 

0417 1 DS0RT»AN 

0017 0 GO T0( 18»30»36»18) ♦NTYPE 



0018 

0018 


0 

1 


EMl=-( ( l.+DELTA) /DELTA )*( 1.- 
ALP3/DSORT)-!. 


+ 


0019 


0 


CON(l )=EM1 




0020 


0 


EM2=-( (l.+DELTA) /DELTA )#( 1.+ 


•f 


0020 


1 


ALP3/DSORT )-l . 




0021 


0 


C0N(2 )=EM2 




0022 


0 


R1=(-ALP3+DS0RT ) /(2.*DELTA) 




0023 


0 


CON(3 )=R1 




0024 


0 


R2= (-ALP3-DS0RT ) /( 2.*DELTA) 


+ 


0025 


0 


C0N(4)=R2 




0026 


0 


GO T0(27»29»29.48) .NTYPE 




0027 


0 


C=T0TAL/(SIGMA*(R2-R1 )**( EM1+ 




0027 


1 


EM2+1.)*GAMAF(EM1+1. )*GAMAF(EM 


+ 


0027 


2 


2 + 1. ) ) 




0127 


0 


C0N(5)=C 




0028 


0 


GO TO 51 




0029 


0 


STOP 





0030 

0131 

O03 1 

0032 

0032 

0034 

0035 



0 A=2./ALP3 

0 C0N(1)=A 

0 asq=a*a 

0 C=A**(ASO)/(SIGMA*EXPEF(ASO)* 

1 GAMAF(ASO)) 

0 C0N(2)=C 

0 GO TO 51 



0036 

0037 

0038 

0039 

0040 

0041 

0042 

0043 

0043 

0044 
0146 
0000 
0000 
0000 
0000 
0502 
0502 

0046 
0246 

0047 



0 R=ALP3/(2.*DELTA) 

0 C0N(1)=R 

0 EM=l./DELTA+2. 

0 C0N(2)=EM 

0 S=DS0RT/(2.*DELTA) 

0 C0N(3)=S 

0 Pl=2.*EM-2. 

0 V=-2.*( l.+DELTA)*ALP3/(DELTA* 

1 D50RT) 

0 C0N(4)=V 

0 TEMP1=2.*EM-1 . 

0 SUM=0. 

0 DO 502' I=l»16 

0 Y=( EZ( I )**R)*EXPEF(V*THETA( I ) ) 

0 SUM=5UM+Y*CST ( I ) 

0 FRV=EXPEF(-1.5707963*V)*SUM* 

1 3.1415927 

0 C=T0TAL*S**TEMP1/(SIGMA*FRV) 

0 ' C0N( 5)=C 

0 GO TO 51 



0048 0 ALPHA=R1-R2 

0049 0 C=T0TAL*GAMAF(-EM2 )/(SIGMA* 

0049 1 GAMAF (EMl+1 . ) *GAMAF( -EM2-EM1- 

0049 2 l.)*ALPHA**(EMl+EM2+l.) ) 

0050 0 C0N(5)=C 
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0 


0051 


0 


CHISQ=0, 


0 


0000 


0 


NPCH=452 


0 


0000 


0 


IF(XCONF(l) )452»52»52 


0 


0452 


0 


PUNCH1»NPCH»C0N ' 


0 


005? 


0 


BELS0=0, 


0 


0053 


0 


XZERO=X ( 1 )-.5 


0 


0153 


0 


PROP=DMIN+DELD».5 


0 


0054 


0 


1 = 1 


^ 0 


005 5 


0 


XX(1 )=XZERO 


0 


0056 


0 


XX(2)=X( I) 


0 


0057 


0 


XX(3)=XZERO+l. 


0 


0058 


0 


L = 1 


0 


0059 


0 


XBAR=XX(L) 


0 


0060 


0 


T=(XBAR-UP1 ) /SIGMA 


0 


0061 


0 


YNORM(L) =.398942 1*EXPEF(-T*T 


0 


0061 


1 


. )*TOTAL/SIGMA 


0 


0062 


0 


GO T0(63*68»70»65) .NTYPE 


0 


0063 


0 


YBAR=(C*(T-R1 )**EM1)*( (R2-T) 


0 


0063 


1 


FM? ) 


0 


0064 


0 


GO TO 91 


0 


0065 


0 


Z= ( T-R? ) 


0 


0066 


0 


YBAR=C*Z«*EM2*( Z-ALPHA)**EM1 


0 


0067 


0 


GO TO 91 



0 0068 0 YBAR=C*(A+T)**(ASQ-1.)*EXPEF + 



0 


0068 


1 


(-A*T ) 


0 


0069 


0 


GO TO 91 
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0070 


0 


TR= ( T+R ) 


0 


0071 


0 


ARG=TR/S 


O' 


~007? 


0 


XSO=ARG*ARG 


0 


0000 


0 


C0E=3. 


0 


0073 


0 


IF(XSQ-1.)83*175.76 


0 


0175 


0 


TANM1=. 78539816 


0 


0275 


0 


GO TO 89 


0 


0076 


0 


TANMl =1.570796 3-1. /A RG 


0 


0077 


0 


0=1. 
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0078 
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D081I=1.9 
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0079 
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TANM1=TANM1+Q/ (ARG*XSQ*COE) 
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0080 
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XSQ=XSQ*ARG*ARG 
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0000 
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C0E=C0E+2. 
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0=-0 


0' 


008? 
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GO TO 89 
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0 


TANM1=ARG 
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0084 
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Q = -l. 
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DO 881=1.9 
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TANMl =TANM 1+0* ARG*XSQ/CCE 
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XSO=XSO*ARG*ARG 
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0000 
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C0E=C0E+2. 
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0088 
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0=-0 
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ARG1=V*( 1.5707963-TANMl ) 
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YBAR=C* ( (TR*TR+S*S )**(-EM )* 
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0090 
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EXPEFIARGl ) ) 
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Y(L)=YBAR 
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0000 
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IF( XCONF( 1 ) )492»92 .92 
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0492 


0 


PUNCHl »NPCH, I »L»XX »YBAR*T » 


0492 


1 


YNORM»XSO*TANMl 


0092 


0 


IF(L-3)93»95»95 


0093 


0 


L = L + 1 


0094 


0 


GO TO 59 


0095 


0 


YNAVG=( YNORM( 1 ) +YNORM ( 3 ) +4. * 


0095 


1 


YN0RM(2) )/3. 


0095 


1 


YN0RM(2) )/6. 


0096 


0 


YAVG=(Y( 1)+Y( 3)+4.*Y(2) )/6. 


0097 


0 


PUNCH1»I»X( I ) ♦PROP»G{ I ) ,Y(2) » 


0097 


1 


YAVG»YN0RM(2 ) 


0098 


0 


IF( I-NA)99,111 ,111 


0099 


0 


BSO=G( I )-YNAVG 


0100 


0 


RSQ=BSQ*BSO 


0000 


0 


IF(YNAVG) 101,102,101 


0101 


0 


BEL5Q=BFLSQ+BSQ/YNAVG 


0102 


0 


CHI=G( I )-YAVG 


0103 


0 


CHI =CHI*CHI 


0000 


0 


IF(YAVG) 104,105,104 


0104 


0 


CHI SO=CHISQ+CH I /YAVG 


0105 


0 


XZER0=XZFR0+1 . 


0106 


0 


YIl )=Y(3) 


0107 


0 


YNORM( 1 )=YN0RM(3) 


1107 


0 


PROP=PROP+DELD 


0000" 


0 


NPCH=4109 


0000 


0 


IF(XCONFd) ) 4 109, 109, 109 


4109 


0 


PUNCHl ,NPCH, I ,L »CHI ,CHISO,BSQ, 


4109 


1 


BELSO 


0109 


0 


1 = 1+1 


0110 


0 


GO TO 55 


0111 


0 


PUNCHl, ZERO 


0112 


0 


PUNCHl ,DMIN ,DMAX ,DELD,DAVG, 


0112 


1 


SIGMA, ALP3, DELTA 


0113 


0 


PUNCHl, ZERO 


0114 


0 


PUNCHl, CON, CHI SO, BELSO 


0115 


0 


IF(J-NUMBR) 116,121 ,121 


0116 


0 


J = J + 1 


0118 


0 


PAUSE 


0119 


0 


GO TO 12 


0121 


0 


PAUSE 


' 0122 


0 


GO TO 1011 


012^ 


0 


END 



■ • a 7 . • . • r ' * Y 

J . j .1 . c ' 0 ^ I 
r 4. - 

n / > ^ ) 



f ^ ' 

ii ►o^ 



•t . ^ ^ ^ ' t - ( • ' ‘ ^ ^ 'J V 

. * \ ( f ' v-ar'^v 

, \ ( ' c > ••-c'rju V 

• 3 \ ' f '•'''. ^ w ‘ ' w r n } rr- s V 

• ^ ) V . ^ I ) ■■ • . f . ' V . . . f » 

( ^ r . pv 

' 1 1 • r ; r . r :? f r ~ J i J ♦ 



•3 7’^^ 

j * "» 

r 

^ pn^» 

- 





! in--'-: - 




•n •: 


1 




r> ~ • St • -• ’ r 1 ' « 




-. r ^ 


* 


.'1 


, t • r , r r ', f { a V 1 . • 




• 1- , 








• 


r ^ ” 






• V ■• If - f I un*- I w* 


• 

V 


r ■ r* 






T f. 1 . 1 _ ~ 




r T* 




♦ ' ' 


'.'<!* f . ' r f -,' ' •. ' * 






• 


?y 


\ I l,-. +A-' 1 . -' - H T 




^ [» 


• 




^ '•4-.aj'^Y - '■•<,'-!'■> 




? 1 ** ( s 


■ 




' ( y - f ' )•/ 


,* 


r*N 






, r ' V err-.' V - * ,• ' ’ t 'X.* 


- 


^ r . 






< • <lf* f *01 * '•fc- '-'-rni 


r 


• »| r j 


*■ 




O' 


V 


oi^-=ir 


t 


* • 


f . o n r i ( ( * ) •{'• • V ' "s ' 


f 


© € ■' -7 


- 


‘•» . ^ 


- . 1 . ’ . ;/* Xl/' . •*■ ' o 




e M r , 






• 


» 


C n r jN 






r ^ ’ 






r 




r - 


' 


r r <• 


- 



.PV^ 



r 



<»S3- . i"^’ 

I • . < ' ‘ ^ • ■^ . (i ■-'-i,:;.'c: 

I ' ' . •» ' . ' >^t.- ; f 

- 1 ^ . • M '(-•IJ,-. 

! . IM' :a-',.'»l" 

’ . r . A f I .. • ' - I 

r 

• .ve 



- I [ f 

r - I r . 

^ r- , . , g 

V .ft'' 

M .' i. •> 

' • t r* / 



■ T 



r r» 



^ P ’ j Q 
* r \ n ^ 



I r ' 
f r M( 



I 



f - r r 



I 



I 



I 



I 






I 



I 






I 



• fl 



GENERALIZED FLOW CHART FOR CURVE FIT TO 
PEARSON TYPE DISTRIBUTION 



162 




ll 




ir< 




'f 

it 



♦ 



1 



165 



A 



PUNCH 

I, X(I), PROP, G(I), Y(2), YAVG, YN0RM(2) 



< 



CALCULATE CHI AND BSQ 
FOR EACH INTERVAL AND 
SUM TO GIVE CHISQ 
AND BELSQ 



Jt. 



1=1 + 1 



0 



I; NA ^ 



> 



PUNCH 

DMIN, DMAX, DELD, DAVG, 
SIGMA, ALP5, DELTA 



PUNCH 

CON, CHISQ, BELSQ 



FIGURE 52 
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I 










f. 
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b 





EXPLANATION OP PLOV/ CHART SYPTBOLS 



A Read or Punch instruction. 
Data are read or punched in the 
order given, left to right. 



An arithmetic calculation. 



( 



■ A : B 



) 



A test of the difference A - B 
for negativity, positivity, or 
zero. The program branches to 
the next appropriate calculation 
sequence depending upon the 
numerical value of this difference. 




A multi-way branch depending 
upon the numerical value of 
the inclosed index. 




A jump is made in the floiir 
chart to the circle v;ith the 
same enclosed letter. 






PIGURE 35 
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APPENDIX C 



14. 15* 17* 

57 . 

52 * 51 - 

56* 11* 15. 19* 



28 • 27 • 26 • 50 . 
10. 29. 25 . 

59 * 40 * 9 » 22 * 

38 . 

35* 54* 



55. 



7* 



12 * 

5* 

18 * 

4 * 1 ♦ 24 ■ 



8 ♦ 2 • 21 * 25 



16* 



SCALE 

1" = 800 ' 



20 * 



J 



FIGURE 34 I-IAP OP VJELLS CORED — FIELD 1 
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1 • 



2* 



4* 



5* 

3 * 



6 * 



9* 7 • 



12 * 






8 » 



\ 



11 » 



SCALE 
1" = 400' 






FIGURE 35 MAP OP l^fELLS CORED — FIELD 2 
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1 ♦ 

2 * 

3 # 4 • 

5 * 

6 * 7 * 

SCALE 

• 1 " = 400 » 



FIGURE 36 MP OP I'JELLS CORED— FIELD 3 
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